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MaxInsertions={enter value here} 208 
OutputToScreen=yes/no 210 
0utputToFile=yes/no 212 
MinimumAccepted={enter value here} 214 
MaxDuplicateFunctionValues={enter value here} 
MaxSearchTime (min.)={enter value here} 218 
Exhaustive=yes/no 220 
NumStochasticProbes={enter value here} 222 
MaxHitsPerProbe={enter value here} 224 
RandomProbeStart=yes/no 226 

FIG.1 1B 
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OutputToScreen = No 
OutputToFile = Yes 
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SeorchTime = 5 
NumStochosticProbes = 10 
MaxHitsPerProbe = 25 
RandomProbeStort = Yes 



Col. 1 


Col. 2 


Col. 3 


Col. 4 


Col. 5 


Col. 6 


Col. 7 


Col. 8 


Col. 9 


Col. 10 


Col. 11 


Code 1 


11 
1 1 


12 


13 

1%J 


14 


Code 2 


c 


N 


C+N 


j 

V 


MoxFunc 


1 

A 


c 


A 




L 


a 
D 


0 AA 


O OA 

z.zU 


A Af\ 

4.4U 


U 


Q OA 

o.oU 


A 


p 

L 






L . 


c 
I 


O AA 

z.uu 


O OA 

z.A) 


A A(\ 

4.4U 


U 


Q OA 

o.oU 


A 


C 






L 


0 


2.00 


2.20 


4.40 


0 


8.80 


A 


C 






L 


E 


2.00 


2.20 


4.40 


0 


8.80 


A 


c 






R 


F 


2.00 


1.57 


3.14 


2 


1.57 


A 


c 






R 


G 


2.00 


1.57 


3.14 


1 


3.14 


A 


c 






R 


H 


2.00 


1.57 


3.14 


0 


6.28 


A 


G 








1 


1.80 


1.33 


2.39 


1 


2.39 


A 


c 


A 


A 


G 


J 


2.00 


1.33 


2.66 


0 


5.32 


A 


c 






R 


K 


2.00 


1.57 


3.14 


0 


6.28 


B 


c 


A 


A 


G 


A 


2.00 


1.33 


2.66 


0 


5.32 


8 


C 


A 




R 


C 


2.00 


1.57 


3.14 


0 


6.28 


8 


C 


A 




R 


0 


2.00 


1.57 


3.14 


0 


6.28 


8 


C 


A 




R 


E 


2.00 


1.57 


3.14 


0 


6.28 ! 


B 


C 


A 


A 


G 


F 


2.00 


1.33 


2.66 


1 


2.66 


B 


c 






R 


G 


2.00 


1.57 


3.14 


1 


3.14 


B 


c 






R 


H 


2.00 


1.57 


3.14 


0 


6.28 


B 


c 


A 


A 


G 


1 


2.00 


1.33 


2.66 


1 


2.66 


B 


. c 


A 


A 


G 


J 


2.00 


1.33 


2.66 


0 


5.32 


B 


c 


A 


A 


G 


K 


2.00 


1.33 


2.66 


0 


5.32 


C 


c 


A 




R 


A 


2.00 


1.57 


3.14 


1 


3.14 


C 


c 


A 




R 


B 


2.00 


1.57 


3.14 


1 


3.14 


C 


c 






L 


D 


2.00 


2.20 


4.40 


1 


4.40 


C 


c 


A 




R 


E 


2.00 


1.57 


3.14 


1 


3.14 


C 


c 






R 


F 


2.00 


1.57 


3.14 


1 


3.14 


C 


c 






R 


G 


2.00 


1.57 


3.14 


1 


3.14 


C 


c 






R 


H 


2.00 


1.57 


3.14 


0 


6.28 


C 


c 


A 




R 


1 


2.00 


1.57 


3.14 


1 


3.14 


C 


c 


A 


A 


R 


J 


2.00 


1.57 


3.14 


0 


6.28 


C 


c 


A 


A 


R 


K 


2.00 


1.57 


3.14 


0 


6.28 



FIG.13B 



Sheet 20 of 90 

AppL No. 10/677,754; Filed: Oct 3, 2003 

Dkt No. 2060.0200003/HCC/PAC; Group Unit: To be 

assigned 
Inventors: Shi lb et al. 
Tel. No.: 202-371-2600 

For Optimized MuW-Epitope Constructs and Uses Thereof 

20/90 



Code 1 


11 


12 


13 


14 


Code 2 


C 


N 


C+N 


J 


MoxFunc 


! 


pi 
J 


A 
b 






L 


A 
A 




0 OA 

Z.ZU 


4.4U 


A 
U 


Q QA 

o.oU 


1 


J 


A 
b 






i 

L 


n 
□ 


z.UU 


0 OA 

Z.ZU 


A A(\ 

4.4U 


A 
U 


Q QA 

o.oU 


1 


J 


A 

b 






i 

L 


a 

b ■ 


O AA 
Z.UU 


0 OA 

Z.ZU 


A A(\ 

4.4U 


A 
U 


Q QA 

o.oU 


1 


J 


a 
b 






i 

L 


r 

L 


O AA 
Z.UU 


0 OA 

Z.ZU 


A A(\ 

4.4U 


A 
U 


Q QA 

o.oU 


1 
1 


J 


a 
b 








r 

r 


1 QA 

l.oU 


1 TX 
l.JJ 


O 70. 

z.jy 


A 
U 


A 70 

4./9 


1 
1 


J 


a 

b 






K 


b 


O AA 
Z.UU 


1.3/ 


X 1A 

J.14 


A 
U 


C OQ 

D.Zo 


1 
1 


J 


a 
b 


A 
A 


A 
A 


a 
b 


H 


O AA 
Z.UU 


1.00 


0 cc 
Z.00 


A 
U 


C 70 
J.JZ 




J 


a 
b 






1 

L 


i 
1 


O AA 
Z.UU 


0 OA 

Z.ZU 


A Af\ 

4.4U 


1 


A 4A 

4.4U 




J 


a 
b 


A 

A 




b 


i 

J 


O AA 
Z.UU 


1 XX 

l.JJ 


O ££ 

Z.00 


A 
U 


£ 70 
J.JZ 


1 


) 


a 

c 






K 


K 


0 AA 
Z.UU 


1.3/ 


7 ^A 
J.14 


A 
U 


C OQ 

D.Zo 






A 

I 


A 


A 

A 


L 


A 

A 


O AA 
Z.UU 


O OA 

Z.zU 


A Af\ 

4.4U 


A 
U 


Q QA 

o.oU 






c 


A 


A 

A 


L 


n 
D 


O AA 
Z.UU 


O OA 

Z.ZU 


A Af\ 

4.4U 


A 
U 


O OA 

o.oU 






c 


A 


A 

A 


L 


I 


O AA 
Z.UU 


O OA 

Z.zU 


A Af\ 

4.4U 


A 
U 


O OA 

o.oU 






c 


A 

A 


A 

A 


i 

L 


D 


O AA 
Z.UU 


O OA 

Z.zU 


A AC\ 

4.4U 


A 
U 


O OA 

o.oO 






c 


A 

A 




n 
R 


r 

r 


1 AA 
Z.UU 


1 til 

1.3/ 


7 1 A 

j.14 


A 
U 


C OO 

6.28 






c 


A 

A 




n 

R 


G 


O AA 
Z.UU 


1.3/ 


7 1 A 

3.14 


A 
U 


c oo 

0.28 






c 


A 

A 




R 


H 


1 AA 

z.UU 


1 C7 

1.3/ 


7 1 A 

3.14 


A 

0 


c oo 

6.28 






c 


A 

A 


A 

A 


i 

L 


1 


O AA 

z.UU 


O OA 

2.2U 


A Af\ 

4.4U 


A 

0 


O OA 

8.80 






c 


A 

A 




R 


J 


O AA 

z.UU 


1 C7 

1.3/ 


7 1 A 

3.14 


A 

0 


c oo 

6.28 






c 


A 




n 

R 


K 


2.00 


1.3/ 


7 1 A 

3.14 


0 


C AO 

6.28 






K 






L 


A 

A 


O AA 

2.20 


A AA 

2.20 


i o A 

4.84 


1 


A O A 

4.84 






1/ 

K 


A 

A 


A 

A 


G 


B 


A AA 

2.20 


1.33 


A A7 

2.93 


1 


A A7 

2.93 






K 


A 

A 


A 

A 


G 


C 


A AA 

2.20 


1 77 

1.33 


A A7 

2.93 


A 

0 


5.85 






K 


A 

A 


A 

A 


G 


D 


A AA 

2.20 


1 77 

1.33 


A A7 

2.93 


A 

0 


c oc 

5.85 






K 


A 


A 

A 


G 


E 


A AA 

2.20 


1.33 


A A 7 

2.93 


0 


5.85 






K 


A 




G 


G 


A AA 

2.20 


1 77 

1.33 


A A7 

2.93 


1 


O A7 

2.93 






K 


A 

A 




G 


• i 
H 


A AA 

2.20 


1 77 

1.33 


A A7 

2.93 


1 


O A7 

2.93 






K 


A 

A 


A 

A 


G 


1 


A AA 

2.20 


1 77 

1.33 


A A7 

2.93 


4 
1 


O A7 

2.93 






K 






R 


i 

J 


A AA 

2.20 


1.3/ 


7 K 

3.45 


4 
1 


7 4C 

3.45 






K 






R 


K 


A AA 

2.20 


1.57 


7 AH 

3.45 


A 

0 


C A1 

6.91 


1 


J 


C 


A 

A 




R 


A 

A 


A AA 

2.00 


1.3/ 


7 14 

3.14 


l 


7 14 

3.14 


1 


J 


c 


A 

A 




R 


n 

B 


A AA 

2.00 


1.3/ 


7 14 

3.14 


A 

2 


1 C7 

1.57 


1 


r* 
J 


c 


A 

A 




R 


C 


A AA 

2.00 


1 C7 

1.3/ 


7 14 

3.14 


4 
I 


7 14 

3.14 


j 
1 


r* 
J 


c 






L 


D 


A AA 

2.00 


A OA 

2.20 


A A(\ 

4.4U 


1 
I 


A 4A 

4.40 


1 


J 


c 


A 

A 




R 


r 

E 


A AA 

2.00 


1 C7 

1.3/ 


7 14 

3.14 


o 
Z 


1 C7 

1.5/ 




r\ 
J 


c 






L 


r 

F 


A AA 

2.00 


O OA 

2.20 


A 4A 

4.4U 


4 


1 1 A 

1.10 


i 




c 






G 


u 

H 


A AA 

2.00 


1 77 

1.33 


o cc 

2.66 


A 

0 


C 70 

5.32 


i 




c 


A 

A 


A 

A 


R 


1 


A AA 

2.00 


1 C7 

1.3/ 


7 14 

3.14 


A 

2 


1 C7 

1.5/ 


i 




c 


A 


A 

A 


R 


■ 

j 


A AA 

2.00 


1 C7 

1.3/ 


7 14 

3.14 


4 

! 


7 14 

3.14 


i 




c 


A 


A 


R 


K 


2.00 


1.57 


3.14 


0 


6.28 




H 


c 


A 


A 


G 


A 


2.00 


1.33 


2.66 


0 


5.32 




H 


c 


A 


A 


G 


B 


2.00 


1.33 


2.66 


1 


2.66 




H 


c 


A 




G 


C 


2.00 


1.33 


2.66 


0 


5.32 




H 




A 




G 


0 


2.00 


1.33 


2.66 


0 


5.32 


i 


H 


•I 


A 


A 


G 


E 


2.00 


1.33 


2.66 


0 


5.32 




H 


c 


A 


A 


G 


f 


2.00 


1.33 


2.66 


1 


2.66 


i 


H 


c 






R 


G 


2.00 


1.57 


3.14 


1 


3.14 


i 


H 


c 


A 


A 


G 


1 


2.00 


1.33 


Z66 


1 


2.66 




H 


c 


A 




G 


J 


2.00 


1.33 


2.66 


1 


2.66 




H 


c 


A 


A 


G 


K 


2.00 


1.33 


2.66 


0 


5.32 



FIG.13C 



Sheet 21 of 90 

AppL No. 10/677,754; Filed: Oct 3, 2003 

Diet No. 2060.0200003/HCC/PAC; Group Unit: To be 

assigned 
Inventors: SETTE et al 
Tel. Na: 202-371-2600 

For. Optimized Multi- Epitope Constructs and Uses 
Thereof 



21/90 



Code 1 


11 


12 


13 


14 


Code 2 


C 


N 


C+N 


J 


MoxFunc 


1 


K 


A 


A 


G 


A 


220 


1 33 


2 93 


U 


iJ.UJ 


1 

i 


K 


A 

n 


A 

n 


G 


R 
u 


2 20 


1 33 


2 93 


1 
1 


9Q3 


1 
i 


K 


A 




G 


f 


2 20 


1 33 


2Q3 


n 

V 


J.OJ 


1 
i 


K 


A 

n 




G 


n 

u 


2 20 


1 33 


293 


n 

u 


J.OJ 


1 
i 


K 


A 


A 


G 


r 

L 


2 20 


1 33 


2Q3 


n 

u 


J.OJ 


1 
i 


K 

rv 


A 


A 

n 


G 


r 
i 


2 20 

L.L\J 


1 33 


2 93 


1 

1 


293 


1 
I 


K 






R 


G 


2 20 


1 S7 


3 4S 
j.tj 


1 

1 


3 AS 

J.tJ 


1 
I 


K 
rv 


A 

n 


A 


G 


U 

n 


2 20 


1 33 
i . j j 


2Q3 

Z.J J 


n 

u 


SflS 

J.OJ ; 


1 

i 


V 

r\ 


A 

n 




G 


1 

0 


2 20 


1 33 


9Q3 

Z.JJ 


1 

1 


2Q3 


i 


r\ 


A 

n 


A 

n 


G 


K 
r\ 


2 20 


1 33 


9Q3 
z.aJ 


u 


SflS 

J.OJ 




1/ 

r\ 


A 
n 


A 

r\ 


p 


A 

n 


2 20 
z.zu 


1 S7 


3 AS 

J.tJ 


n 
u 




i 


K 


A 
n 


A 
n 


p 


D 
D 


9 90 
z.zu 


1 S7 


AS 

J.tJ 


1 
s 


J.tJ 


i 


V 

r\ 


A 
r\ 




p 


p 


9 90 
z.zu 


1 S7 


J.tJ 


u 


KQ1 

o.y 1 


I 


K 


A 
n 




P 


n 


9 90 

Z.ZU 


1 S7 
i .j/ 


3 AS 

J.tJ 


n 
u 




i 




A 
M 




p 


F 

L 


9 90 
z.zu 


1 S7 


3 AS 
J.tJ 


i 
i 


\ AS 

J.tJ 


j 


K 


A 


A 


R 


F 


2.20 


1.57 


3.45 


2 


1.73 




K 






R 


G 


2.20 


1.57 


3.45 


1 


3.45 




K 






R 


H 


2.20 


1.57 


3.45 


0 


6.91 




K 


A 


A 


R 


1 


2.20 


1.57 


3.45 


1 


3.45 




K 


A 


A 


R 


K 


2.20 


1.57 


3.45 


0 


6.91 


K 


K 






L 


A 


2.20 


2.20 


4.84 


0 


9.68 


K 


K 






L 


B 


2.20 


2.20 


4.84 


0 


9.68 


K 


K 






L 


C 


2.20 


2.20 


4.84 


0 


9.68 


K 


K 






L 


D 


2.20 


2.20 


4.84 


0 


9.68 


K 


K 


A 


A 


L 


E 


2.20 


2.20 


4.84 


0 


9.68 


K 


K 


A 


A 


R 


F 


2.20 


1.57 


3.45 


1 


3.45 


K ' 


G 








G 


1.80 


1.33 


2.39 


0 


4.79 


K 


K 






R 


H 


2.20 


1.57 


3.45 


0 


6.91 


K 


K 






L 


1 


2.20 


2.20 


4.84 


1 


4.84 


K 


K 






R 


J 


2.20 


1.57 


3.45 


0 


6.91 



Junctional Analyzer took 142.77 seconds. 



FIG.13D 



Sheet 22 of 90 

AppL No. 10/677.754; Filed: Oct 3, 2003 

Dkt No. 2060.0200003/HCC/PAC; Group Unit: To be 



Inventors: SETTE et al. 
Tel. No.: 202-371-2600 

For. Optimized Multi- Epitope Constructs and Uses Thereof 



SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS3 8 




22/90 



EEESEE5ESSESS5S5E 



S3 

CXI 



CM 



^1 



Csl 
O 

o 
* 

CD 

i 



O 
CD 



CNI 



K\SSSSSSSSS\SSSS\SS 



o 
o 

I 



< 



CNJ 
CVI 



I CO 



2 £ *5 



lssssssssssssssssssss: o 



S8 

<s 



1000 
100 



L 



Sheet 23 of 90 

AppL No. 10/677,754; Filed: Oct 3, 2003 

Dkt No. 2O60.O20OO03/HCC/PAC; Group Unit To be 

assigned 
Inventors: St lib et al. 
Tel. No.: 202-371-2600 

For Optimized Multi- Epitope Constructs and Uses Thereof 



Responses to PfCTLI 



23/90 

peptide/IFA 
DNA 



100„ 



CO 



CO 
<D 
CO 



CO 




I 



SSP2-230 EXP1-13 
HLA-A*0201 



CSP-7 



i 



i 



i 



1 



LSA1-94 LSA1-105 SSP2-522 
HLA-AM101 



SSP 2-539 
HLA-B*0702 



1000-, 



FIG.14B-1 

Responses to PfCTL2 




5416 SU 



I 



i 



SSP2-14 EXP1-50 
HLA-A*0201 



EXP1-91 



SSP2-523 EXP1-10 LSA1-11 
HW-AM101 



SSP2-126 
HLA-B*0702 



Sheet 24 of 90 

AppL No. 10/677,754; Filed: Oct 3, 2003 

Dirt No. 2060.0200003/HCC/PAC; Group Unit To be 

assigned 
Inventors: St l i b ct al. 
Tel. No.: 202-371-2600 

For Optimized Multi-Epitope Constructs and Uses 
Thereof 

24/90 




Sheet 25 of 90 

Appl. No. 10/677.754; Filed: Oct 3, 2003 

Dkt No. 2060.0200003/HCC/PAC; Group Unit: To be 

assigned 
Inventors: SETTE et a). 
Tel. No.: 202-371-2600 

For: Optimized Multi-Epitope Constructs and Uses Thereof 

25/90 



EZ1 60mer polypeptide (- GPGPG spacers) 
^ 75mer polypeptide (+ GPGPG spacer) ' 




Pol 711 Gag 171 Pol 335 Pol 303 

Epitope 

FIG. 15 



Sheet 26 of 90 

Appl. No. 10/677,754; Filed: Oct 3, 2003 

Dkt No. 2060.0200003/HCC/PAC; Group Unit: To be 



Inventors: SETTE et al. 
Tel. No.: 202-371-2600 

For Optimized Multi- Epitope Constructs and Uses Thereof 



26/90 




(tOO ft 0l/0JS) asuodsai tih 



Sheet 27 of 90 

Appl. No. 10/677,754; Filed: Oct 3, 2003 

Dkt No. 2060.0200003/HCC/PAC; Group Unit: To be 



Inventors: SETTE et al. 
Tel. No.: 202-371-2600 

For Optimized Multi-Epitope Constructs and Uses 



Thereof 



27/90 



OQ.OCLO 



Q-fO 



OQ-O Q-O 



E 

to 



o 



OQ-OQ-O 



CD 

T 



2§ 



OQ-OO-O 



28 



OCLOQ-O 



o2 
Q- to 



OQ-OQ-O 



2g 



O Q-OQ-O 



Q_ to 



O Q-OQ-O 



lO 

O ro 
Q_ to 



O Q-OQ-O 



Q- CT> 



O Q-OQ-O 



o r-« 

Q_ OO 



OQ-OQlO 



> to 

£8 



O Q-OQ-O 



o r-* 
o *- 



O Q-OQ-O 



O CM <N 



O Q-OQ-O 



O Q-OQ-O 



,Q.fO 



O Q-O Q-O 



— <o 
o o» 

Q- m 



O Q-OQ-O 



— to 
o Q-O Q-O 



i 

i 



GET 

2 



O Q-O Q-O 



©O 
Q-fO 



OQ-O Q-O 



O- 



OQ-O Q-O 



o2 

Q- to 



OQ-O Q-O 



_-CO 



OQ-O Q-O 



Q- co 



OQ-O Q-O 



— lO 
O to 

Q- ro 



O Q-O Q-O 



oif2 

Q- 



OQ-OO-O 



O Q-OQ-O 



> to 
C to 



O Q-OQ-O 



OQ-O Q-O 



CO 
CSI CM 



OQ-OO-O 



> cn 



O Q-OQ-O 



^ to 



OQ-O Q-O 



— to 
© cn 

Q- iT) 



O Q-O Q-O 



to 

o tO 
Q- o> 

OQ-O Q-O 



Sheet 28 of 90 

Appl. No. 10/677.754; Filed: Oct 3, 2003 

Dkt No. 2060.0200003/HCC/PAC; Group Unit: To be 

assigned 
Inventors: SETTE et aL 
Tel. No.: 202-371-2600 

For Optimized Mulli-Epitopc Constructs and Uses Thereof 

28/90 

EP-HIV-1090 

M6MQVQIQSLFLLLLWVPGSRGKLVGKLNWAGAAILKEPVHGVNAACPKVSFEPIKIPIHYCAPAKAKFVAAW 
TLKAAAKAFPVRPQVPLWKLTPLC\rTLGAMVU\EAMSQVKVYUWVPAHKGAAM 
SDAKNIPYNPQSQGVVKHPVHAGPIANVTVYYGVPVWKKAAAQMAVFIHNFKNAAAYPLASLRSLFNLTFGWC 
FKLNRILQQLLFINAKIQNFRVYYRKAAVTIKIGGQLKKVPLQLPPLKAMTNNPPIPV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAAGCTGG 
TGGGCAAACTCAACTGGGCCGGAGCTGCAATCCTGAAGGAGCCCGTCCACGGGGTGAATGCCGCTTGCCCTAA 
AGTCAGCTTCGMCCMnMGATCCCCAnCAnACTGTGCACCTGCCAMGCTMGTTTGTGGCCGCTTGG 
ACCCTCAAGGCCGCTGCAAAAGCCTTCCCAGTGAGGCCCCAGGTGCCTCTGGGCGCCGCTAAACTCACACCAC 
TGTGCGTCACTCTGGGAGCCGCTGCAGTGCTGGCAGAGGCCATGTCCCAAGTGAAGGTGTATCTGGCTTGGGT 
GCCCGCCCACMGGGGGCCGCTGCAGCCATCTTTCAGTCTAGCATGACCAAGAAAACAACTCTGTTCTGTGCC 
TCCGACGCTAAGAACATCCCTTATAATCCACAGTCTCAGGGCGTGGTCAAGCATCCCGTGCACGCCGGACCTA 
TTGCTAACGTGACCGTGTACTATGGGGTCCCAGTGTGGAAGAAAGCCGCTGCACAGATGGCCGTGTTTATTCA 
CMTTTCAAAAACGCCGCTGCATACCCCCTCGCCAGCCTGAGATCCCTCTTCAACCTGACATTCGGCTGGTGC 
TnMGCTGMCCGGATCCTGCAGCMCTGCTCTTTATCMTGCTAAAATCCAGAACTTCCGCGTCTACTATA 
GGAAGGCTGCAGTGACTATCAAAATTGGCGGACAACTGAAGAAAGTGCCTCTCCAGCTGCCCCCTCTCAAGGC 
AATGACCAACAATCCCCCTATCCCAGTCTGA 

HIV-CPT 

MGMQVQIQSLFLLLLWVPGSRGIPIHYCAPAKAAKIQNFRVYYRI^^ 

PLQLPPLKAIFQSSMTKKLTPLCVTLGAQMAVFIHNFKGAKVYUWVPAHKNAIPYNPQSQGVVKAILKEPVH 
GVGAMLTFGWCFKL^VUEAMSQVNRILQQLLFINAAACPKVSFEPIKVTVYYGVPVWKKAAHPVHAGPIA 
NAAAYPLASLRSLFNAAATTLFCASDAKNKLVGKLNWANAAAFPVRPQVPLNMTNNPPIPV 

ATGGGGATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAATCCCCA 
TTCACTACTGCGCCCCTGCTAAGGCAGCCAAAATCCAGAACTTCAGGGTGTATTACAGAAAGGCTGCAGTCAC 
CAnAAMTCGGCGGACMCTGMGAAAGCCAAGTTTGTGGCCGCTTGGACACTCAAGGCCGCTGCAAAGGTC 
CCACTGCAGCTCCCCCCTCTGAAGGCCATCTTCCAGAGCTCCATGACTAAGAAACTGACCCCACTGTGTGTGA 
CACTCGGGGCCCAGATGGCTGTGnCATCCATMTTTTAAAGGCGCCAAGGTCTACCTGGCTTGGGTGCCCGC 
ACACAAGAACGCCATTCCTTACAATCCACAGTCTCAAGGAGTGGTCAAAGCTATTCTGAAGGAGCCCGTGCAC 
GGGGTGGGCGCCGCTGCACTCACTTTCGGATGGTGCTTTAAACTGAACGCCGTGCTGGCTGAAGCCATGAGCC 
AGGTCAATCGGATCCTGCAGCAACTGCTCTTCATTAACGCCGCTGCATGTCCTAAGGTGTCCTTCGAGCCAAT 
CAAAGTGACCGTGTATTACGGGGTCCCCGTGTGGAAGAAAGCCGCTCATCCTGTCCACGCAGGCCCAATCGCC 
MCGCCGCTGCATATCCCCTCGCCTCTCTGCGCAGCCTGTTTMCGCCGCTGCMCMCCCTCTTTTGCGCCT 
CCGACGCTAAGAATAAACTGGTGGGAAAGCTGAACTGGGCCAACGCAGCTGCATTCCCTGTGAGGCCACAGGT 
CCCCCTCAATATGACTAACAATCCCCCTATCCCAGTGTGA 
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HIV-FT 

MQVQIQSLFLLLLWVPGSRGKLVGKLNWAMASDFNLPPVAIFQSSMTKVTIKIGGQLKRILQQLLFIMAVFIH 
NFKIPYNPQSQGVVTTLFCASDAKILKEPVHGVQMAVFIHNFKGAAVFIHNFKRCPKVSFEPIKIQNFRVYYR 
LTFGWCFKLQVPLRPMTYKMTNNPPIPVTVYYGVPVWKVLAEAMSQVIPIHYCAPAKLTPLCVTL 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAAGCTGGTGGGGA 
AGCTGAACTGGGCCATGGCCAGCGATTTCAACCTGCCCCCCGTGGCCATCTTCCAGAGCAGCATGACCAAGGT 
GACCATCAAGATCGGGGGGCAGCTGAAGAGGATCCTGCAGCAGCTGCTGTTCATCATGGCCGTGTTCATCCAC 
AACTTCAAGATCCCCTACAACCeCCAGAGCCAGGGGGTGGTGACCACCCTGTTCTGCGCCAGCGATGCCAAGA 
TCCTGAAGGAGCCCGTGCACGGGGTGCAGATGGCCGTGTTCATCCACAACTTCAAGGGCGCCGCCGTGTTCAT 
CCACAACTTCAAGAGGTGCCCCAAGGTGAGCTTCGAGCCCATCAAGATCCAGAACTTCAGGGTGTACTACAGG 
CTGACCTTCGGGTGGTGCTTCAAGCTGCAGGTGCCCCTGAGGCCCATGACCTACAAGATGACCAACAACCCCC 
CCATCCCCGTGACCGTGTACTACGGGGTGCCCGTGTGGAAGGTGCTGGCCGAGGCCATGAGCCAGGTGATCCC 
CATCCACTACTGCGCCCCCGCCAAGCTGACCCCCCTGTGCGTGACCCTG 
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HIV-TC 

MGMQVQIQSLFLLLLWVPGSRGYWQATWIPEWKAIFQSSMTKKVYLAWVPAHKNAACPKVSFEPIKHPVHAGP 

IANLTFGWCFKLNKMIGGIGGFIKFRDYVDRFYKAAARILQQLLFINTTLFCASDAKNQMVHQAISPRGAKLV 

GKLNWAGAAAI YETYGDTWKAAQVPLRPMTYKGAAAVTVLDVGDAYNAAARYLKDQQLLNTLNFP I SP I NMTN 

NPPIPVNAPYNTPVFAIKAMVPLQLPPLKMIPYNPQSQGVVKALLQLTVWGIGMILKEPVHGVNAAAFPI 

SPIETVKVWKEATTTLFKAMVTIKIGGQLKKIYQEPFM^ 

TDSQYKAMIPIHYC/^AKAVIYQYMDDLYKAMQMAVFIHNFKNMTYQIYQEPFKPYNE 

RVYYRKAFPVRPQVPLGAAAIWGCSGKLIKVMIVWQVDRNMKMCWWAGIKAKFVMWTLKAMKLTP^ 

LNAAMASDFNLPP VKS LLNATD I AVNVTVYYG VP VWKKAAAA I IRI LQQLKRAMASDFNLNAAAYPLASLRSL 

F 

ATGGGGATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCTAGAGGATACTGGC 
MGCTACTTGGATTCCAGMTGGAMGCTATCTTTCAATCCTCAATGACGAAGAAGGTATACCTGGCATGGGT 
CCCAGCACACAAGAACGCCGCTTGCCCAAAGGTGTCCTTTGAACCCATTAAACACCCAGTGCACGCAGGGCCA 
ATAGCGAATTTGACAnCGGGTGGTGCnCAMCTAMCAAMTGATCGGCGGCATTGGAGGCTTTATCAAGT 
TTAGAGATTACGTGGACCGATTCTATAAAGCCGCTGCCCGTATACTCCAGCAGCTACTATTCATCAACACCAC 
TCTCTTCTGCGCTTCAGACGCTAAGAACCAAATGGTACACCAAGCCATAAGCCCTAGAGGAGCCAAGCTCGTA 
GGGAMnAMTTGGGCGGGTGCAGCAGCAATCTACGAGACTTACGGCGATACCTGGAAAGCAGCCCAGGTTC 
CGTTACGCCCAATGACCTATAAAGGCGCAGCAGCAGTAACAGTTCTAGATGTAGGAGACGCTTACAACGCTGC 
CGCMGATACCTAAMGATCAGCAGTTACTCMCACACTAMTTTCCCAATTAGCCCGATAAACATGACAAAT 
AACCCACCAATTCCCGTCAATGCTCCCTACAACACTCCAGTATTCGCAATCAAAGCCGCTGCTGTCCCCCTGC 
AGCTCCCTCCTCTGAAAGCTGCGATACCTTACAACCCACAGAGCCAAGGTGTTGTCAAAGCACTGCTTCAGCT 
MCAGTnGGGGMnGGTGCTGCMTTCTAAMGAGCCAGTTCATGGGGTTAACGCCGCCGCCTTCCCAATC 
AGTCCTATTGAGACTGTGAMGTATGGAMGMGCCACMCCACACTTTTTAAGGCAGCCGCAGTTACAATTA 
AMTAGGGGGCCMCTTMGAAMTATACCAGGMCCTnCAAGAATCtCAAAGCCGCTGCAGTGCTCGCCGA 
GGCTATGTCACAGGTGAATTTGGTCGGACCAACACCCGTAAACATCGGAGCCGCAGCCGAAGTGAACATAGTC 
ACCGACTCACAGTACAAAGCCGCTGCAATACCCATACATTATTGTGCTCCCGCAAAGGCCGTGATCTATCAAT 
ATATGGACGACCTGTATMGGCCGCCGCGCAGATGGCAGTCTTTATCCACMCTTTAAAAACGCAGCTACTTA 
TCAGATCTACCAGGMCCATTCAMCCGTACMTGAGTGGACCTTGGMCTAMGGCCAAMTTCAGAACTTC 
AGGGTATATTATAGAAAAGCATTTCCAGTGAGGCCCCAGGTGCCTCTGGGTGCCGCAGCAATATGGGGATGTT 
CTGGAAAACTGATCAAGGTGATGATTGTATGGCAAGTGGACAGAAATGCAGCTAAGGCAGCCTGTTGGTGGGC 
AGGTATAAAAGCAAAGTTCGTGGCAGCATGGACGCTTAAAGCAGCCGCAAAACTCACTCCTCTCTGCGTGACA 
CnMTGCAGCCATGGCCTCTGATTTCAACCTTCCCCCTGTAAAATCCCTGCTTAATGCGACAGATATCGCAG 
TCMCGTMCAGTATATTATGGCGTGCCAGTCTGGAAAAAAGCCGCCGCGGCCATAATTCGGATACTGCAGCA 
GCTGAAMGAGCTATGGCGAGTGACTTCMCCTGMTGCGGCCGCCTACCCCTTGGCATCGTTAAGGTCACTA 
TTTTGA 
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HCV.l 

MGMQVQIQSLFLLLLWVPGSRGLLFNIL6GWVDLMGYIPLVYLVAYQATVILAGYGAGVRLIVFPDLGVHMWNFISGI 
YLLPRRGPRLYLVTRHADVVLVGGVLAALLFLLLADAFLLLADARVWMNRLIAFACTCGSSDLYLSAFSLHSYGVAGA 
LVAFKLPGCSFSIFKTSERSQPRLIFCHSKKKFWAKHMWNFIPFYGKAIRMYVGGVEHRQLFTFSPRRRLGVRATRKV 
GIYLLPNRAKFVAAWTLKAAA* 

GAATTCGCCGCCACCATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGACTG 

CTGTTCAACATCCTGGGGGGGTGGGTGGATCTGATGGGGTACATCCCCCTGGTGTACCTGGTGGCCTACCAGGCCACC 

GTGATCCTGGCCGGGTACGGGGCCGGGGTGAGGCTGATCGTGTTCCCCGATCTGGGGGTGCACATGTGGAACTTCATC 

AGCGGGATCTACCTGCTGCCCAGGAGAGGACCTAGACTGTACCTGGTGACTAGACACGCTGATGTGGTGCTGGTGGGA 

GGAGTGCTGGCTGCTCTGCTGTTTCTGCTGCTGGCTGATGCTTTCCTGCTGCTGGCTGATGCTAGAGTGTGGATGAAC 

AGACTGATCGCTnCGCTTGTACATGTGGAAGCTCCGATCTGTATCTGAGCGCTTTGAGCCTGCACAGCTACGGAGTG 

GCTGGAGCTCTGGTGGCTTTTMGCTGCCTG^ 

CTGATCTTTTGTCACAGCMGMGMGTTTTGGG 

AGMTGTATGTGGGAGGAGTGGMCACAGACAGCTGTTTACATTTAGCCCTAGAAGGAGACTGGGAGTGAGAGCTACA 
AGAAAGGTGGGAATCTATCTGCTGCCTAATAGATGAAAGCTTGGG* 

HCV.2 

MGMQVQIQSLFLLLLWVPGSRGDLMGYIPLVAKFVMWTLKAAALLFLLLADALIFCHSKKKQLFTFSPRRYLVTRHA 
DVYLLPRRGPRLCTCGSSDLYHMWNFISGIFWMHMWNFAKFVMWTLKAMILAGYGAGVYLVAYQATVGVAGALVA 
FKIPFYGKAIRMYVGGVEHRVLVGGVLMFLLI^ARVLPGCSFSIFMFVMWTLKAAAKTSERSQPRRLGVRATRK 
RLIVFPDLGVWMNRLIAFALSAFSLHSYLLFNILGGWVVGIYLLPNR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGT7TCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 

GGAGATCTGATGGGATATATCCCTCTGGTGGCTMGTTTGTGGCTGCnGGACACTGMGGCTGCTGCTCTGCTGTTT 

CTGCTGCTGGCTGATGCTCTGATCnCTGTCACAGCMGMGMGCAGCTGTTTACATTTAGCCCAAGAAGATATCTG 

GTGACAAGACACGCTGATGTGTATCTGCTGCCTAGACGCGGACCTAGACTGTGTACATGTGGAAGCTCCGATCTGTAT 

CACATGTGGMCTnATCAGCGGMTCTTTTGGGCTMGCACATGTGGMTTTCATCCTGGCTGGATATGGAG 

GTGTATCTGGTGGCTTATCAGGCTACAGTGGGAGTGGCTGGAGCTCTGGTGGCTTTCMGATCCCAnCTATGGAAAG 

GCTATCAGAATGTATGTGGGAGGAGTGGAACACAGAGTGCTGGTGGGAGGAGTGCTGGCTGCTTTCCTGCTGCTGGCT 

GATGCTAGAGTGCTGCCAGGATGTAGCTTrAGCATCTTCAAGACTTCCGAACGCTCCCAGCCTAGAAGACTGGGAGTG 

AGAGCTACMGGMGAGACTGATCGTGTTTCCAGATCTGGGAGTGTGGATGAATAGACTGATCGCTTTCGCTCTGAGC 

GCTTTCAGCCTGCACAGCTATCTGCTGTTCAACATCCTGGGAGGATGGGTGGTGGGAATCTATCTGCTGCCAAACAGA 

TGAAAGCTT 

HCV.3sl 

MGMQVQIQSLFLLLLWVPGSRGYLVAYQATVAKFVMI^LKAMLLFLLU^DALIFCHSKKKYLVTRHADVLGFGAYM 
SKCTCGSSDLYHMWNFISGI FWAKHMWNF* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 

GGATACCTCGTCGCCTACCAGGCCACTGTGGCTAAATTCGTGGCAGCCTGGACACTGAAAGCTGCAGCTGTGCTCTTC 

CTGCTCCTGGCCGATGCACTCATCTTCTGCCATTCCMGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCTGGGG 

TTTGGCGCCTACATGAGCMGTGCACCTGTGGCAGCTC^ 

TGGGCCAAGCACATGTGGAATTTCTGAAAGCTT 
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HCV.3S2 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVUWmMWTLI^ 
NRLIAFAIPFYGKAIVAGALVAFKVGIYLLPNR* 

GMnCGCCGCCACCATGGGMTGCAGGTGCAGATCCAMGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCCGCTGCTAAGTTTGTCGCTGCTTGGACACTGAAGGCAGCCGC 
TTTCCTGCTCCTGGCAGACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGC 
GCAGGCGTGTGGATGAATCGGCTGATCGCCTTTGCCATTCCATTCTATGGGAAAGCCATTGTGGCTGGCGCCC 
TGGTGGCATTCAAGGTCGGGATCTACCTCCTGCCTAACCGCTGAAAGCTT 

HCV.3s2(-3) 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVUW\KFVAAWTLKAAAFLLLADARVLSAFSLHSYILAGYGAGVWM 
NRLIAFA* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCCGCTGCTAAGTTTGTCGCTGCTTGGACACTGAAGGCAGCCGC 
TTTCCTGCTCCTGGCAGACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGC 
GCAGGCGTGTGGATGAATCGGCTGATCGCCTTTGCCTGAGGATCC 

HCV.3S3 

MGMQVQIQSLFLLLLWVPGSRGDLMGYIPLVAKFVMWTLKAAARLGVRATRKLLFNILGGWVRMYVGGVEHR 
RLIVFPDLGVGVAGALVAFKLPGCSFSIFKTSERSQPRQLFTFSPRRYLLPRRGPRL 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAGACCTGATGGGCTACATCCCTCTCGTGGCCAAGTTTGTGGCAGCTTGGACCCTGAAGGCCGCTGC 
CAGACTGGGAGTGCGCGCTACACGGAAACTCCTGTTTAACATCCTGGGAGGGTGGGTGCGGATGTACGTCGGA 
GGCGTCGAGCACAGAAGGCTCATTGTCTTTCCAGATCTCGGCGTGGGCGTCGCAGGCGCACTCGTGGCCTTCA 
MCTGCCAGGGTGCAGCnCAGCATTTTCMGACCTCCGMCGCTCCCMCCCAGACAGCTGTTCACTTTCTC 
TCCTCGGAGGTATCTGCTGCCCAGACGCGGACCCAGGCTGTGAAAGCTT 

HCV.PC3 

MGMQVQIQSLFLLLLWVPGSRGLLFNILGGWVKAKFVMl^LKAMUDGGCSGGAYRLIVFPDLGVKFWAKH 
MWNFIGVAGALVAFKKQLFTFSPRR* 

GMTTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGACTGCTCTTCAACATCCTGGGCGGATGGGTGAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGC 
TGCCGCTCTGGCCGACGGGGGATGCAGCGGCGGAGCnACAGGCTCATTGTCTTTCCCGATCTCGGAGTCAAA 
TTTTGGGCAMGCACATGTGGMTTTCATCGGGGTGGCCGGAGCCCTGGTCGCTTTTAAAMGCAGCTCTTCA 
CCTTCTCCCCAAGACGGTGAGGTACC 
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HCV.PC4 

MGMQVQIQSLFLLLLWVPGSR6RLGVRATRKKAKFVAAWTLKAAAKTSERSQPRNLPGCSFS IFNDLMGY I PL 
VKYLLPRRGPRLNTLCGFADLMGYRMYVGGVEHR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTAAMCMGCGAGCGCTCCCAGCCCAGGAACCTGCCTGGATGCTCTTTCAGCATCTTTAATGACCTCATG 
GGGTACATTCCACTGGTGAAGTATCTGCTCCGCAGACGGGGCCCTCGCCTGAACACTCTCTGTGGATTTGCTG 
ATCTGATGGGGTACAGGATGTATGTCGGCGGAGTCGAACACAGATGAGGTACC 

HCV.243K1P) 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVU\AAFLLUDARVLSAFSLHSYILAGYGAGVWMNRLIAFAGAAAR 
LGVRATRKKAAAKTSERSQPRNLPGCSFSIFNDLMGYIPLVKYLLPRRGPRLNTLCGFADLMGYRMYVGGVEH 
RKLLFNILGGWVKAAALADGGCSGGAYRLIVFPDLGVKFWAKHMWNFIGVAGALVAFKKQLFTFSPRRNGYLV 
AYQATVAAALLFLLLADALIFCHSKKKYLVTRHADVLGFGAYMSKCTCGSSDLYHMWNFISGIFWAKHMWNFK 
AAAAKFVAAWTLKAAA 



GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGCT 
CCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCAGCCGCTTTCCTGCTCCTGGCAGACGCCAGGGTGCTGTCTGC 
CTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGTGGATGAATCGGCTGATCGCCTTTGCC 
GGCGCTGCCGCAAGGCTGGGCGTGAGAGCCACCCGeAAGA^ 

CCAGGMCCTGCCTGGATGCTCTTTCAGCATCTTTAATGACCTCATGGGGTACATTCCACTGGTGAAGTATCT 
GCTCCCCAGACGGGGCCCTCGCCTGAACACTCTCTGTGGATTTGCTGATCTGATGGGGTACAGGATGTATGTC 
GGCGGAGTCGAACACAGAAAACTGCTCTTCAACATCCTGGGCGGATGGGTGAAGGCTGCCGCTCTGGCCGACG 
GGGGATGCAGCGGCGGAGCnACAGGCTCAnGTCTTTCCCGATCTCGGAGTCAMTTTTGGGCAAAGCACAT 
GTGGMTTTCATCGGGGTGGCCGGAGCCCTGGTCGCTTTTAAAMGCAGCTCnCACCTTCTCCCCAAGACGG 
AACGGATACCTCGTCGCCTACCAGGCCACTGTGGCTGCAGCTCTGCTCTTCCTGCTCCTGGCCGATGCACTCA 
TCTTCTGCCATTCCMGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCTGGGGTTTGGCGCCTACATGAG 
CMGTGCACCTGTGGCAGCTCCGACCTGTATCACATGTGGMCTTTATTTCTGGMTCTTTTGGGCCAAGCAC 
ATGTGGMTTTTAAGGCCGCAGCAGCTAAATTCGTGGCAGCCTGGACACTGAAAGCAGCTGCATGAGGATCC 
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HCV.4312(1P) 

M6MQVQIQSLFLLLLWVPGSRGRLGVRATRKKAAAKTSERSQPRNLPGCSFSIFNDLMGYIPLVKYLLPRRGPRLNTLC 
GFADLMGYPJWVGGVEHPJ(LLFNILGGWVKAMLADGGCSGG^ 

TFSPRRNGYLVAYQATVAMLLFLLLADALIFCHSKKKYLVTRHADVLGFGAYMSKCTCGSSDLYHMWNFISGIFWAKH 
MWNFKKAMVLVGGVLAMFLLUVDARVLSAFSL^^ 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGCTCCAGAG 
GAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCTGCCGCTAAAACAAGCGAGCGCTCCCAGCCCAGGAACCTGCCTGG 
ATGCTCTTTCAGCATCTTTAATGACCTCATGGGGTACATTCCACTGGTGAAGTATCTGCTCCCCAGACGGGGCCCTCGC 
CTGAACACTCTCTGTGGATTTGCTGATCTGATGGGGTACAGGATGTATGTCGGCGGAGTCGAACACAGAAAACTGCTCT 
TCAACATCCTGGGCGGATGGGTGAAGGCTGCCGCTCTGGCCGACGGGGGATGCAGCGGCGGAGCTTACAGGCTCATTGT 
CTnCCCGATCTCGGAGTCAMTTTTGGGCAMGCACATGTGGMTnCATCGGGGTGGCCGGAGCCCTGGTCGCTTTT 
AAAAAGCAGCTCTTCACCTTCTCCCCAAGACGGAACGGATACCTCGTCGCCTACCAGGCCACTGTGGCTGCAGCTCTGC 
TCTTCCTGCTCCTGGCCGATGCACTCATCTTCTGCCATTCCAAGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCT 
GGGGTTTGGCGCCTACATGAGCMGTGCACCTGTGGCAGCTCCGACCTGTATCACATGTGGMCTTTATTTCTGGAATC 
TTnGGGCCMGCACATGTGGMTTTTAAGAAAGCCGCTGCAGTCCTGGTGGGCGGCGTCCTGGCAGCCGCTTTCCTGC 
TCCTGGCAGACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGTGGAT 
GMTCGGCTGATCGCCTTTGCCAATGCTGCAGCTAAATTCGTGGCAGCCTGGACACTGAAAGCAGCTGCATGAGGATCC 

AOSI.K 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVKFLLSLGIHLYMDDVVLGVGLSR 
YVARLFLLTRILTISTLPETTVVRRQAFTFSPTYKWLSLLVPFV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTTTCCTGCCTAGCGATTTCTT 
TCCTAGCGTGAAGTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGG 
TACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGG 
CCTTCACCTTTAGCCCTACCTATAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGTGA 

HBV.l 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVMmKAAAFLPSDFFPSVFLLSLGIHLYMDDVVLGVGLSRY 
VARLFLLTRILTISTLPETTVVRRQAFTFSPTYKWLSLLVPFVIPIPSSWAFTPARVTGGVFKVGNFTGLYLPSDFFPS 
VTLWKAG I LYKNVS I PWTHKLVVDFSQFSRSAICSVVRRALMPLYAC I 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 

AGGCCGGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTTTCCTGCCTAGCGATTTCTT 

TCCTAGCGTGTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGGTAC 

GTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGGCCT 

TCACCTTTAGCCCTACCTATMGTGGCTGAGCCTGCTGGTGCCCTTTGTGATCCCTATCCCTAGCTCCTGGGCTTTCAC 

CCCAGCCAGGGTGACCGGAGGAGTGmMGGTGGGAMCnCACCGGCCTGTATCTGCCCAGCGATTTC 

GTGACCCTGTGGMGGCCGGGATCCTGTACMGMTGTGTCCATCCCTTGGACCCACMGCTGGTGGTGGACTTTTCC 

AGTTCAGCAGATCCGCTATCTGCTCCGTGGTGAGGAGAGCTCTGATGCCACTGTATGCCTGTATCTGA 
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HBV.2 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVNFLLSLGIHLYMDDVVLGVGLSR 
YVARLFLLTRILTISTLPETTVVRRQAFTFSPT7KGAMWLSLLVPFVNIPIPSSWAFKTPARVTGGVFKVGNFTGLYN 
LPSDFFPSVKTLWKAG I LYKNVS I PtfTHKGAALVVDFSQFSRNSAI CS VVRRALMP LYAC I 

ATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGIGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTnCCTGCCTAGCGATTTCTr 
TCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGG 
TACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGG 
CCnCACCTTTAGCCCTACCTATMGGGAGCCGCTGCCTGGCTGAGCCTGCTGGTGCCCTTTGTGAATATCCCTATCCC 
TAGCTCCTGGGCTnCMGACCCCAGCCAGGGTGACCGGAGGAGTGTTTMGGTGGGAMCTTCACCGGCCTGTATAAC 
CTGCCCAGCGATTTCTTTCCTAGCGTGMGACCCTGTGGMGGCCGGMTCCTGTACMGMTGTGTCCATCCCTTGGA 
CCCACMGGGAGCCGCTCTGGTGGTGGACTTTTCCCAGTTCAGCAGAAATTCCGCTATCTGCTCCGTGGTGAGGAGAGC 
TCTGATGCCACTGTATGCCTGTATCTGA 

PfCTL.l 

MQVQIQSLFLLLLWVPGSRGILSVSSFLFVNAMQTNFKSLLRNLPSENERGYKAMLLACAGU\YKKAAMKFVMWT 
LKAAAKAFMKAVCVEVNAMSFLFVEALFNATPYAGEPAPFKAAAKYKU\TSVLKAGVSENIFLKNAMYFILVNLLIK 
AGLLGVVSTV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAATCCTGAGCGTGT 

CCTCTnCCTGTTTGTCMCGeCGCTGCACAGACCMTTTCMGAGCCTCCTGAGGMCCTCCCCTCCGAGMCGAAAG 

AGGCTACAAAGCCGCTGCACTGCTCGCCTGCGCTGGACTGGCCTATAAGAAAGCCGCTGCAGCCAAGTTCGTGGCCGCT 

TGGACACTGMGGCCGCTGCAAMGCCTnATGMGGCTGTCTGTGTGGAGGTCMTGCCGCTGCATCTTTCCTGTn 

TGGAGGCCCTCTTTMCGCTACTCCTTACGCAGGGGMCCAGCCCCCTTCAAGGCCGCTGCAAAATATAAGCTGGCAAC 

CAGCGTGCTGMGGCTGGCGTGTCCGAGMTATTTnCTGAAAAACGCCGCTGCATACTTCATCCTGGTGAATCTGCTC 

ATTAAGGCCGGACTCCTGGGGGTGGTCTCTACAGTGTGA 

PfCTL.2 

MQVQIQSLFLLLLWVPGSRGFVEALFQEYNAAAKYLVIVFLINALACAGLAYKKFYFI LVNLLKAALFFI I FNKNAAAK 
FVMWTLKAAAKFILVNLLIFHNFQDEENIGIYKLPYGRTNLKAMVLLGGVGLVLNFLIFFDLFLVKAVLAGLLGVV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGATTCGTGGAGGCCC 

TGTTTCAGGAATACAACGCCGCTGCAAAGTATCTCGTCATCGTGTFCCTGATCAATGCTCTGGCATGCGCCGGCCTCGC 

TTACAAAMGTTnACTTCATTCTGGTCMCCTGCTCMGGCCGCTCTGnCTTTATCATTTTCM 

GCTMGmGTGGCCGCATGGACCCTGMGGCCGCTGCAAMTTCATCCTCGTGMTCTGCTCATTTn 

MGACGAGGAAMTATCGGMTTTATAAGCTGCCCTACGGGAGGACAAACCTGAAAGCCGCTGCAGTCCTGCTCGGCGG 

AGTGGGGCTGGTGCTCMTmCTGATCnC^ 

GTGTGA 
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PfCTL.3 

MQVQIQSLFLLLLWVPGSRGVFLIFFDLFLNAAAPSDGKCNLYKAAAVTCGNGIQVRKLFHIFDGDNEIKAHVLSHNSY 

EKNYYGKQENWYSLKKILSVFFLANAAAKFIKSLFHIFKAM^ 

AAGLIMVLSFL 

ATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAGTGTTCCTGATCT 

TCTTTGACCTGTTCCTGAACGCCGCTGCACCCAGCGATGGCMGTGCAATCTCTACAAGGCCGCTGCAGTGACCTGTGG 

AMCGGGATTCAGGTCAGGAMCTCTTTCACATCTTCGACGGCGATAACGAGATCAAGGCCCATGTGCTGTCCCACAAT 

TCnATGAAAAAMCTACTATGGAMGCMGAGMTTGGTACAGCCTGMGAAMnCTGTCCGTGTTCTTTCTCGC 

ACGCCGCTGCAMGTTTATCMGTCTCTGTTCCATATTTTCMGGCCGCTGCACTCTACATCAGCTTCTATTTTA 

AGCCAAATTTGTGGCCGCTTGGACACTGAAGGCCGCTGCAAAAGCCGCTGCATACTATATCCCTCACCAGAGCTCCCTG 

AAGGCCGCTGCAGGGCTGATCATGGTGCTCTCTTTCCTGTGA 

PfCTL/HTKN) 

MQVQIQSLFLLLLWVPGSRGSSVFNVVNSSIGLIMVLSFLGPGPGLYISFYFILVNLLIFHINGKIIKNSEGPGPGPDS 
IQDSLKESRKLSGPGPGVLAGLLGVVSTVLLGGVGLVLGPGPGLPSENERGYYIPHQSSLGPGPGQTNFKSLLRNLGVS 
ENIFLKGPGPGFQDEENIGIYGPGPGKYLVIVFLIFFDLFLVGPGPGKFIKSLFHIFDGDNEIGPGPGKSKYKLATSVL 
AGLLGPGPGLPYGKTNLGPGPGRHNWVNHAVPLAMKLIGPGPGMRKLAILSVSSFLFVEALFQEYGPGPGVTCGNGIQV 
RGPGPGMNYYGKQENWYSLKKGPGPGPSDGKCNLYADSAWENVKNVIGPFMKAVCVEVGPGPGKILSVFFLALFFIIFN 
KGPGPGHVLSHNSYEKGPGPGKYKIAGGIAGGLALLACAGLAYKFVVPGAATPYAGEPAPF 

ATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGTAGTGTGTTCA 

ATGTTGTGAACTCATCAATTGGTCTGATCATGGTGCTGAGCTTTCTCG 

GGCCAGGGCCAGGATTATATATTTCTTTCTACTC 

AMGMCAGTGAAGGCCCTGGGCCTGGGCCTGACTCGATCCAGGATTCTCTAAAAGAATCGAGGAAGCTCTCCGGACCA 

GGCCCTGGTGTACTCGCCGGGTTGCTGGGAGTAGTTAGCACAGTGCTGTTAGGAGGCGTCGGCCTCGTCTTAGGACCTG 

GACCAGGTCTGCCGTCCGAAAACGAAAGAGGATACTACATACCTCACCAGAGCAGCCTCGGCCCAGGCCCCGGACAAAC 

CMTTTCAMTCCCTCTTGCGAMTCTAGGAGTGAGCGAGMCATATTTCTTAMGGACCCGGTCCCGGCTTTCAGGA^ 

GAGGAGAATATAGGTATTTACGGTCCAGGACCTGGAAAATACCTAGTGATCGTATTCCTAAI 1 1 1 1 1 1 IGACCTATTTC 

TGGTGGGCCCAGGTCCCGGAMGnCAnAMTCACTCnCCACATTTTTGACGGAGATMCGAGATAGGA 

CGGGAAATCAAAGTACAAACTAGCCACTTCAGTGCTGGCCGGCCTTCTAGGGCCGGGCCCAGGGCTCCCCTATGGAAAG 

ACAAATCTTGGCCCCGGTCCAGGACGGCACAACTGGGTGAATCATGCGGTTCCATTGGCCATGAAACTAATCGGGCCCG 

GTCCAGGCATGCGCAMCTTGCMTTCTMGCGTMGTTCATTTCTGTTCGTAGAGGCACTGTTTCAAGAATATGGCCC 

AGGACCTGGCGTCACATGTGGGAATGGGATCCAGGTGAGAGGACCGGGACCTGGTATGAACTATTACGGTAAACAGGAA 

AATTGGTACTCCCTGAAAAAGGGTCCAGGCCCCGGCCCCTCAGATGGTAAGTGCAACCTGTATGCTGACTCAGCATGGG 

AGMCGTAAAAMTGTMTAGGCCCATTCATGMGGCAGTnGTGTCGMGTCGGACCAGGCCCAGGAAAMTACTTTC 

TGTCncnCCTAGCTCTCnCTTCATCA^ 

GAAAMGGGCCAGGACCTGGGAMTACAAMTCGCAGGAGGGATCGCCGGCGGGCTAGCGCTCCTTGCCTGCGCAGGCT 
TGGCnACAMnCGTTGTACCAGGAGCTGCMCACCCTATGCAGGAGMCCTGCCCCATTTTGAAGATCTGC 
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Pf33 

MGMQVQIQSLFLLLLWVPGSRGFMKAVCVEVNVTCGNGIQVRKGLIMVLSFLNMLFHIFDGDNEIKMLLACAGLAYK 

KSFLFVEALFNMPSDGKCNLYKAAQTNFKSLLRNLPSENERGYKAAGVSENIFLKNAAAYFILVNLLIKAAAILSVSS 

FLFVNTPYAGEPAPFKAAAKYKU\TSVLKMVFLIFFDLFLNYYIPHQSSLKMGLLGNVSTVGAVLLGGVGLVLNI_AC 

AGLAYKKAKFIKSLFHIFKMFYFILVNLLKAFLIFFDLFLVKALFFIIFNKNYYGKQENWYSLKFVEALFQEYNAAAK 

FVMWTLKAAAKILSVFFI^AVU\GLLGNVNFQDEENIGIYKAMLYISFYFIKAFILVNLLIFHNM 

AHVLSHNSYEKNAAAKYLV I VFL I 

GCCGCCACCATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGATTTA 
TGAAAGCTGTCTGTGTAGAGGTGAATGTAACATGCGGTAACGGAATTCAGGTGAGAAAGGGACTCATCATGGTACTCAG 
CTTTCTGMCGCAGCCCTGTTCCACATCTTTGACGGAGACAATGAAATCAAAGCCGCATTGCTCGCCTGTGCCGGACTA 
GCCTATAAAMGAGTTTCCTTTTCGTTGMGCACTATTTMCGCAGCACCCAGTGACGGTAMTGCMCCTATATAAAG 
CAGCTCAGACTMTTTCAAMGCCTGTTMGAMTCTGCCCTCAGAGMTGAAAGGGGTTACAAAGCCGCCGGCGTGTC 
CGAGMTATTTTCCTGMGMCGCCGCTGCTTATTTTATACTCGTGMTCTACTCATAMGGCAGCCGCMTCCTTTCA 
GTGTCCAGCTnCTGTTTGTTMCACACCATATGCGGGCGAGCCGGCTCCTTTCMGGCTGCAGCAAAATACAAGCTTG 
CCACATCAGTATTGAMGCAGCTGTGTTTTTGATATTCTTTGATCI 1 1 1 1 1 1 AAACTACTACATACCTCATCAGTCTAG 
TCnAMGCAGCCGGGCTACTGGGGMCGTCTCTACTGTGGGGGCCGTCTTACTTGGAGGAGnGGCCTCGTGTTGAAC 
CTCGCGTGCGCAGGTCTGGCCTACAAAAMGCGAMTTCATCMGTCTCTGTTCCACATTTTTAMGCCGCATTCTATT 
TCATACTAGTGMCCTTCTCAMGCTnCCT^ 

TMCAAAMTTAnACGGCMGCMGAAMTTGGTACTCACTCMGTTTGTAGMGCTCTGTTCCAGGAATACAACGCC 
GCTGCTAMTTCGnGCAGCTTGGACCCTGAMGCAGCTGCAMGATCCTATCGGTCnCTTTCTCGCTAATGCCGTAT 
TAGCAGGACnCTAGGCMCGTGMCTTTCMGACGMGAGMTATAGGCATCTACAMGCCGCAGCACTGTACATTTC 
ATTCTACTTCATCMGGCCTTCATACTGGTCMCCTTCTGATATTTCATAATGCAGCACTGCCATATGGGAGAACCAAC 
TTGAMGCGGCCCACGTGTTGAGCCACAACTCCTACGAGAAGAACGCCGCCGCGAAATATCTCGTCATTGTCTTCCTGA 
TTTGA 



TB.l 

MQVQIQSLFLLLLWVPGSRGRMSR\nTFTVKALVLLMLPVVNLMIGTAMVVKALVLLMLPVGAGLMTAVYLVGAAA 

LLRLPVKRMFAANLGVNSLYFGGICVGRLPLVLPAVNAAAAKFVMWTLKAAAKAAARLMIGTAMGFVVALIPLVNAM 

TYAAPLFVGAAAAMALLRLPLV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGGATGAGCAGAGTGACCA 
CATTCACTGTCAAGGCCCTGGTGCTCCTGATGCTCCCCGTCGTGMCCTGATGATCGGCACCGCTGCAGCCGTCGTGAA 
AGCTCTCGTCCTGCTCATGCTCCCTGTGGGAGCAGGGCTGATGACAGCCGTGTACCTGGTCGGCGCTGCAGCCATGGCC 
CTCCTGCGGCTGCCAGTGMGCGCATGTTTGCTG^ 

GMGGCTGCCCCTCGTGCTGCCTGCTGTGMTGCAGCCGCTGCCAMTTTGTCGCCGCTTGGACTCTGAAGGCAGCCGC 
TMGGCCGCTGCMGACTGATGATCGGGACCGCCGCTGCCGGCTTCGTGGTCGCCCTGATTCCCCTGGTGAACGCCATG 
ACATACGCAGCTCCTCTGTTTGTGGGAGCCGCTGCAGCCATGGCTCTCCTGCGGCTGCCACTGGTGTGA 
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BCL A2 #90 

MQVQIQSLFLLLLWVPGSRGIMI6HLVGVNRLLQETELVNAKVAEIVHFLNAKVFGSLAFVNAYLSGANLNVG 
MYLQLVFGIEVNAAAKFVMI^LKAMKAMVVLGVVFGINSMPPPGTRVN/W\ATVGIMIGVNAKLCPVQL 
WV 

ATGGAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAATTATGATCGGCC 
ATCTGGTGGGCGTCAACAGACTGCTGCAGGAAACCGAGCTGGTGAATGCCAAGGTGGCCGAAATTGTGCACTT 
TCTCMCGCAMGGTGTTTGGTTCCCTGGCTTTTGTCAATGCCTATCTGAGCGGCGCTAACCTCAACGTCGGA 
GCCGCCTACCTCCAGCTGGTCTTCGGCATCGAGGTCAACGCTGCTGCAAAATTCGTGGCAGCTTGGACCCTCA 
AGGCTGCAGCAAAGGCTGCCGCCGTCGTGCTCGGAGTGGTGTTCGGGATCAACTCTATGCCACCTCCCGGGAC 
TAGGGTCAATGCTGCCGCCGCAACAGTGGGAATCATGATTGGGGTGAATGCCAAACTGTGCCCAGTGCAACTG 
TGGGTGTGA 

BCL A2 #88 

MQVQIQSLFLLLLWVPGSRGVVLGVVFGINAAAAKFVAAWTLKAAAKVAEIVHFLNAYLSGANLNVGAAYLQL 
VFGIEVNIMIGHLVGVNRLLQETELVNAKVFGSLAFVNAKLCPVQLWVNAAAATVGIMIGVNSMPPPGTRV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAGTCGTGCTGGGAG 
TCGTCTTCGGCATTAATGCCGCCGCTGCAAAGTTCGTGGCTGCCTGGACCCTGAAGGCCGCAGCTAAAGTGGC 
AGAGATCGTGCACTTTCTGAACGCCTACCTGAGCGGAGCAAATCTGAACGTCGGCGCTGCCTATCTGCAGCTC 
GTGTTTGGAATTGAAGTGAACATCATGATTGGACATCTGGTGGGCGTGAACAGGCTGCTCCAGGAAACTGAGC 
TGGTCAACGCTAAAGTGTTCGGGTCTCTCGCCTTTGTGAACGCTAAGCTCTGCCCCGTCCAACTCTGGGTCAA 
TGCCGCAGCCGCTACAGTGGGGATCATGATCGGCGTGAACTCCATGCCTCCACCAGGGACCAGAGTGTGA 

BCL A2 #63 

MQVQIQSLFLLLLWVPGSRGKLCPVQLWVNAAAATVGIMIGVNIMIGHLVGVNRLLQETELVNAKVAEIVHFL 

NAKVFGSU\FVNAYLSGANLNVGMYLQLVFGIEVNAAAKFVMWTLKAMKAMVVLGVVFGIN 

V 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAAAGCTCTGCCCCG 
TGCAACTGTGGGTCAACGCCGCCGCCGCAACCGTCGGCATTATGATCGGGGTGAACATCATGATCGGACACCT 
GGTCGGCGTGAACAGGCTGCTGCAGGAGACAGAACTGGTCAATGCCAAGGTGGCTGAAATTGTCCATTTCCTG 
AATGCCAAAGTGTTCGGCTCTCTCGCTTTCGTGAACGCTTATCTGAGCGGAGCTAACCTCAACGTGGGGGCCG 
CATACCTCCAGCTCGTCTTTGGGATTGAGGTGAATGCCGCAGCTAAATTTGTCGCTGCCTGGACCCTGAAGGC 
AGCAGCCAAGGCTGCCGCAGTGGTGCTGGGAGTGGTGTTTGGAATCAATTCCATGCCTCCACCAGGCACTAGA 
GTGTGAGGATCC 
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Prostate 1 

LTFFWLDRSVKAMVLVHPQWVLTVKAAALLQERGVAYIKMLLLSIALSVNPLVCNGVLQGVKMIMYSAH 
TTVKAAAFLTPKKLQC VNAMMNDQLMFLNAGLPS I PVHPVKAAALGTTCYVGAAI LLWQP I P VNFLRPRSLQC 
VKAFLTLS VTW I GVNALLYS LVHN LGAATLMSAMTN L 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATTGACAI 1 1 1 1 1 1 

GGCTGGATAGATCGGTTAAGGCTGCAGCGGTGCTTGTTCATCCCCAGTGGGTCTTGACCGTAAAGGCTGCCGC 

GCTGCTACAAGAAAGAGGGGTCGCATACATCAAAGCTGCTCTCCTCTTGAGTATTGCGCTAAGTGTAAACCCG 

CTAGTTTGTAATGGGGTGTTACAAGGTGTGAAAGCGGCGATTATGTACAGTGCCCACGACACTACCGTAAAAG 

CAGCCGCTTTCCTGACCCCAAAAAMCTCCMTGCGTGMCGCMTGATGMTGATCAGCTGATGTTTTTAAA 

CGCTGGCnACCnCTATACCGGnCATCCAGTCAAGGCCGCGGCATTGGGTACGACGTGTTATGTTGGAGCA 

GCGATACnCTnGGCAGCCCATACCAGTAMTTTTTTMGACCTAGATCCTTACMTGCGTCAMG 

TTACACTCTCAGTAACTTGGATCGGAGTCAATGCTCTGCTATATAGCCTCGTACACAACTTGGGCGCGGCCAC 

ACTTATGAGTGCAATGACGAATTTAGCTAAGTTCGTGGCGGCCTGGACTCTAAAGGCCGCAGCA 

HIV-1043 

MEKVYLAWVPAHKGIGGGPGPGQKQITKIQNFRVYYRGPGPGWEFVNTPPLVKLWYQGPGPGYRKILRQRKID 
RLIDGPGPGQHLLQLTVWGIKQLQGPGPGGEIYKRWIILGLNKIVRMYGPGPGQGQMVHQAISPRTLNGPGPG 
IKQFINMWQEVGKAMYGPGPGWAGIKQEFGIPYNPQGPGPGKTAVQMAVFIHNFKRGPGPGSPAIFQSSMTKI 
LEPGPGPGEVNIVTDSQYALGIIGPGPGHSNWRAMASDFNLPPGPGPGAETFYVDGAANRETKGPGPGGAVVI 
QDNSDIKVVPGPGPGFRKYTAFTIPSINNE 

ATGGAGAAGGTGTACCTGGCCTGGGTTCCAGCCCACAAAGGCATCGGGGGAGGGCCCGGACCTGGGCAGAAAC 
AGATCACCAAGATCCAGAACTTCCGGGTATACTACCGGGGACCTGGTCCAGGTTGGGAGTTTGTGAACACACC 
ACCCTTAGTAAAGCTCTGGTACCAGGGCCCCGGTCCCGGATACCGTAAAATCCTGAGGCAAAGAAAGATAGAT 
CGCCTCATTGATGGCCCGGGCCCAGGCCAGCACCTTCTGCAGCTTACAGTGTGGGGAATTAAACAGCTGCAGG 
GGCCGGGCCCCGGGGGGGAMTTTATAAAAGGTGGATCATTCTGGGTCTGAACAAGATCGTCCGCATGTATGG 
CCCTGGACCCGGACAGGGGCAGATGGTCCACCAAGCAATCAGCCCTCGAACCTTGAATGGACCGGGCCCAGGA 
ATCAAGCAATTCATTAACATGTGGCAAGAAGTTGGTAAGGCTATGTACGGTCCCGGCCCTGGATGGGCAGGGA 
TAAAACAGGAGTTTGGAATCCCTTACAATCCCCAGGGTCCTGGGCCAGGTAAAACGGCAGTGCAGATGGCCGT 
GTTCAnCATMTTTTMGCGGGGCCCTGGACCTGGCAGCCCAGCTATATTTCAAAGTTCGATGACCAAAATC 
TTGGAGCCCGGCCCAGGGCCGGGCGAAGTGAACATTGTCACAGATTCTCAGTATGCCCTCGGCATCATAGGGC 
CCGGACCAGGGCATTCCAATTGGCGCGCCATGGCGTCTGACTTTAATCTACCTCCTGGGCCAGGCCCTGGCGC 
GGAMCTTTCTATGTGGACGGCGCTGCAAACAGGGAGACTAAGGGACCCGGACCCGGCGGCGCTGTAGTCATT 
CAGGACAACTCAGACATCAAGGTGGTTCCCGGTCCAGGCCCCGGGTTCAGAAAGTATACCGCCTTCACTATTC 
CGTCCATCAACAATGAGTGA 
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HIV- 1043 PADRE 

MEKVYLAWVPAHKGIGGGPGPGQKQITKIQNFRVYYRGPGPGWEFVNTPPLVKLWYQGPGPGYRKILRQRKID 
RLIDGPGPGQHLLQLTVWGIKQLQGPGPGGEIYKRWIILGLNKIVRMYGPGPGQGQMVHQAISPRTLNGPGPG 
IKQFINMWQEVGKAMYGPGPGWAGIKQEFGIPYNPQGPGPGKTAVQMAVFIHNFKRGPGPGSPAIFQSSMTKI 
LEPGPGPGEVNIVTDSQYALGIIGPGPGHSNWRAMASDFNLPPGPGPGAETFYVDGAANRETKGPGPGGAVVI 
QDNSDIKVVPGPGPGFRKYTAFTIPSINNEGPGPGAKFVMWTLKAAA 

ATGGAGAAGGTGTACCTGGCCTGGGTTCCAGCCCACAAAGGCATCGGGGGAGGGCCCGGACCTGGGCAGAAAC 
AGATCACCMGATCCAGAACTTCCGGGTATACTACCGGGGACCTGGTCCAGGTTGGGAGTTTGTGAACACACC 
ACCCTTAGTAAAGCTCTGGTACCAGGGCCCCGGTCCCGGATACCGTAAAATCCTGAGGCAAAGAAAGATAGAT 
CGCCTCATTGATGGCCCGGGCCCAGGCCAGCACCTTCTGCAGCTTACAGTGTGGGGAATTAAACAGCTGCAGG 
GGCCGGGCCCCGGGGGGGAAATTTATAAAAGGTGGATCATTCTGGGTCTGAACAAGATCGTCCGCATGTATGG 
CCCTGGACCCGGACAGGGGCAGATGGTCCACGAAGCAATCAGCCCTCGAACCTTGAATGGACCGGGCCCAGGA 
ATCAAGCAATTCATTAACATGTGGCAAGAAGTTGGTAAGGCTATGTACGGTCCCGGCCCTGGATGGGCAGGGA 
TAAAACAGGAGTTTGGAATCCCTTACAATCCCCAGGGTCCTGGGCCAGGTAAAACGGCAGTGCAGATGGCCGT 
GnCATTCATMTTTTMGCGGGGCCCTGGACCTGGCAGCCCAGCTATATTTCAAAGTTCGATGACCAAAATC 
TTGGAGCCCGGCCCAGGGCCGGGCGAAGTGAACATTGTCACAGATTCTCAGTATGCCCTCGGCATCATAGGGC 
CCGGACCAGGGCATTCCAATTGGCGCGCCATGGCGTCTGACTTTAATCTACCTCCTGGGCCAGGCCCTGGCGC 
GGAAACTTTCTATGTGGACGGCGCTGCAAACAGGGAGACTAAGGGACCCGGACCCGGCGGCGCTGTAGTCATT 
CAGGACAACTCAGACATCAAGGTGGTTCCCGGTCCAGGCCCCGGGTTCAGAAAGTATACCGCCTTCACTATTC 
CGTCCATCAACAATGAGGGCCCCGGCCCAGGTGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTG 
A 

HIV 75mer 

EKVYLAWVPAHKGIGGPGPGQGQMVHQAISPRTLNGPGPGSPAIFQSSMTKILEPGPGPGFRKYTAFTIPSIN 
NE 

GAGAAGGTGTACCTGGCCTGGGTGCCTGCCCACAAGGGAATCGGAGGACCTGGCCCTGGACAGGGACAGATGG 
TGCACCAGGCCATCAGCCCTAGGACCCTGAACGGACCTGGACCTGGAAGCCCTGCCATCTTCCAGAGCAGCAT 
GACCAAGATCCTGGAGCCCGGACCTGGACCTGGATTCAGGAAGTACACCGCCTTCACCATCCCCAGCATCAAC 
AACGAGTGA 
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PfHTL 

MQVQIQSLFLLLLWVPGSRGRHNWVNHAVPLAMKLIGPGPGKCNLYADSAWENVKNGPGPGKSKYKLATSVL 
AGLLGPGPGQTNFKSLLRNLGVSEGPGPGSSVFNVVNSSIGLIMGPGPGVKNVIGPFMKAVCVEGPGPGMNY 
YGKQENWYSLKKGPGPGGLAYKFVVPGAATPYGPGPGPDSIQDSLKESRKLNGPGPGLLIFHINGKIIKNSE 
GPGPGAGLLGNVSTVLLGGVGPGPGKYKIAGGIAGGLALLGPGPGMRKLAILSVSSFLFV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGGCAC 
AACTGGGTGAATCATGCTGTGCCCCTGGCTATGAAGCTGATCGGCCCTGGACCAGGGAAATGCAACCTCTAC 
GCAGACAGCGCCTGGGAGAACGTCAAGAATGGCCCCGGACCTGGGAAATCCAAGTATAAGCTCGCTACCTCT 
GTGCTGGCAGGCCTGCTCGGACCAGGCCCCGGACAGACAAATTTCAAAAGCCTGCTCAGAAACCTGGGAGTG 
TCCGAGGGGCCTGGCCCAGGATCTAGCGTCTTTAATGTGGTCAACTCCTCTATTGGGCTCATCATGGGACCC 
GGACCTGGGGTGAAAAATGTCATTGGCCCATTCATGAAGGCCGTGTGTGTCGAAGGACCCGGGCCTGGCATG 
AACTACTATGGAAAGCAAGAAAATTGGTACAGCCTGAAGAAAGGCCCTGGGCCAGGCGGACTGGCTTACAAG 
TTTGTGGTCCCAGGGGCAGCCACTCCCTATGGGCCTGGGCCAGGCCCCGATTCCATCCAGGACTCTCTCAAA 
GAGAGCCGGAMCTGMCGGACCCGGGCCTGGACTGCTCATTTTCCACATCAATGGCAAAATTATCAAGAAC 
AGCGAGGGACCTGGGCCAGGCGCCGGACTGCTGGGGAACGTGTCCACCGTCCTGCTCGGCGGAGTGGGGCCC 
GGCCCTGGGAAGTACAAGATCGCTGGAGGGATCGCAGGCGGACTGGCCCTCCTGGGCCCAGGACCAGGGATG 
CGCAAACTGGCTATTCTCTCTGTCTCCAGCTTTCTGTTTGTGTGA 
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Sequence 


1 1 

Restriction 


HI 
n j 


L V 


yay 


386 


VLAEAMSQV 


HLA-A2 


H] 


[V 




271 


MTNNPPIPV 


HLA-A2 


HI 


[V 


pol 


774 


MASDFNLPPV 


HLA-A2 


H] 


[V 


pol 


448 


KLVGKLNWA 


HLA-A2 


H 


:v 


pol 


163 


LVGPTPVNI 


HLA-A2 


H 


:v 


DOl 


498 


ILKEPVHGV 


HLA-A2 


H] 


:v 


pol 


879 


KAACWWAGI 


HLA-A2 


H] 


[V 


pol. 


132 


KMIGGIGGFI 


HLA-A2 


H 


:v 


pol 


772 


RAMASDFNL 


HLA-A2 


H] 


[V 


no! 


183 


TLNFPISPI 


HLA-A2 


H] 


[V 

L ¥ 


env 

d 1 V 


134 


KLTPLCVTL 


HLA-A2 


H] 


[V 

L V 


pnv 


651 


LLQLTVWGI 


HLA-A2 


H] 


[V 


pnv 


163 


SLLNATDIAV 


HLA-A2 


HI 


fV 

L V 


npf 


221 


LTFGWCFKL 


HLA-A2 


HI 


L V 


vnr 

V yJl 


59 


AIIRILQQL 


HLA-A2 


H] 


L V 


vnr 

v yj i 


62 


RILQQLLFI 


HLA-A2 


HI 
1 1 j 


L V 


nnl 


929 


QMAVFIHNFK 


HLA-A3 


H] 
1 1 . 




nol 


722 


KVYLAWVPAHK 


HLA-A3 


HI 


» V 


nnl 


971 


KIQNFRVYYR 


HLA-A3 


HI 


. V 


nnl 


347 


AIFQSSMTK 


HLA-A3 


HI 


[V 

L V 


nnl 


98 


VTIKIGGQLK 


HLA-A3 


H] 


[V 

L ¥ 


env 

Vl 1 V 


61 


TTLFCASDAK 


HLA-A3 


HI 


[V 

L V 


1 1 I V 


47 


VTVYYGVPVWK 


HLA-A3 


HI 


[V 

L V 


npf 

1 IvJ 1 


100 


QVPLRPMTYK 


HLA-A3 


HI 
1 1 j 


L V 


vif 

V l 1 


7 


VMIVWQVDR 


HLA-A3 


H] 


. V 


nan 
yay 


162 


QMVHQAISPR 


HLA-A3 


H] 


[V 

L V 


yay 


545 


YPLASLRSLF 


HLA-B7 


HI 


► V 


yay 


237 


HPVHAGPIA 


HLA-B7 


HI 


[V 

L V 


nnl 


186 


FPISPIETV 


HLA-B7 


HI 




nnl 


893 


IPYNPQSQGVV 


HLA-B7 


H] 


[V 


pnv 

^1 1 V 


259 


IPIHYCAPA 


HLA-B7 


HI 


. V 


pnv 

CI 1 V 


250 


CPKVSFEPI 


HLA-B7 


HI 


tv 


1 IC 1 


94 


FPVRPQVPL 


HLA-B7 - 


H] 


[V 


tpv 

1 C V 


75 


VPLQLPPL 


HLA-B7 


H] 


. V 


nnl 


684 


EVNIVTDSQY 


HLA-A1 


HI 


[V 


yay 


317 


FRDYVDRFY 


HLA-A1 


H] 


[V 


pol 


368 


VIYQYMDDLY 


HLA-A1 


H] 


[V 


pol 


295 


VTVLDVGDAY 


HLA-A1 


H] 


[V 


pol 


533 


IYQEPFKNL 


HLA-A24 


H] 


[V 


pol 


244 


PYNTPVFAI 


HLA-A24 


H] 


[V 


pol 


530 


TYQIYQEPF 


HLA-A24 


H] 


[V 


pol 


597 


YWQATVIIPEW 


HLA-A24 


H] 


[V 


env 


681 


IWGCSGKLI 


HLA-A24 


H] 


[V 


env 


671 


RYLKDQQLL 


HLA-A24 
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protein 


Sequence 


Restriction 


1 IT1# - r~ r- 

HIV env 55 


% tt ft / a™ 1 M\ ■^^•■^a P" 

VWKEATTTLF 


HLA-A24 


UTU im M AC 

HIV vpr 46 


t \/r*T\/A ati i 

IYETYGDTW 


HLA-A24 


UTW « mM i a 

HIV vpr 14 


A\/k i r~i m r* i 

PYNEWTLEL 


HLA-A24 


UT\/ onn 

HIV gag 298 


l/AI IT T 1 Al III/ T uniiv/ 

KRWIILGLNKIVRMY 


HLA-DR 


HIV pol 596 


WEFVNTPPLVKLWYQ 


HLA-DR 


UTW r%j^T ftrr 

HIV pol 956 


AI/atti/ TAiirnw\/\/n 

QKQITKIQNFRVYYR 


HLA-DR 


UTW r-v/O "71 O 

HIV pol 712 


l/WWI Al l\/n A t ll/ATAA 

KVYLAWVPAHKGIGG 


HLA-DR 


HIV gag 294 


AFTWI/m ITTI Al klt/T 

GEIYKRWIILGLNKI 


HLA-DR 
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HBV-2 



MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVNFLLSLGIHLYMDDVVL6VGLS 
RYVARLFLLTRI LTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVNI P I PSSWAFKTPARVTGGVFKVGNFTGL 
YNLPSDFFPSVKTLWKAGILYKNVSIPWTHKGAALVVDFSQFSRNSAICSVVRRALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGG 
MGGCCGGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTTTCCTGCCTAGCGATTTC 
TTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCC 
AGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGG 
CAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCTGCTGGTGCCCTTTGTGAATATCCCT 
ATCCCTAGCTCCTGGGCTnCMGACCCCAGCCAGGGTGACCGGAGGAGTGTTTAAGGTGGGAAACTTCACCGGCCTG 
TATMCCTGCCCAGCGATnCTTTCCTAGCGTGAAGACCCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATC 
CCTTGGACCCACMGGGAGCCGCTCTGGTGGTGGACTTnCCCAGTTCAGCAGAAATTCCGCTATCTGCTCCGTGGTG 
AGGAGAGCTCTGATGCCACTGTATGCCTGTATCTGA 



FIG.20D 



HBV-2A 



MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVNFLLSLGIHLYMDDVVLGVGLS 
RYVARLFLLTRILTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVNIPIPSSWAFKTPARVTGGVFKVGNFTGL 
YNLPSDFFPSVKTLWKAGILYKNVSIPMTTIKGAALVVDFSQFSRNSAICSVVRRKAWMMWYWGPSLYKKYTSFPWLLN 
AHPAAMPHLLKAAADLLDTASALYNAAARFSWLSLLVPFNAASWPKFAVPNLKLTFGRETVLEYKALSLDVSAAFYGA 
AEYLVSFGVWGAALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGG 
MGGCCGGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTTTCCTGCCTAGCGATTTC 
TTTCCTAGCGTGAAC7TCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCC 
AGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGG 
CAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCTGCTGGTGCCCTTTGTGAATATCCCT 
ATCCCTAGCTCCTGGGCTnCMGACCCCAGCCAGGGTGACCGGAGGAGTGTTTAAGGTGGGAAACTTCACCGGCCTG 
TATMCCTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATC 
CCnGGACCCACMGGGAGCCGCTCTGGTGGTGGACTTTTCCCAGTTCAGCAGAAATAGCGCCATCTGTTCGGTCGTG 
AGAAGGAAAGCCTGGATGATGTGGTACTGGGGTCCTAGTCTGTATAAGAAGTACACCTCATTCCCATGGCTCTTGAAT 
GCCCATCCCGCTGCAATGCCACACCTGCTTAAAGCTGCGGCGGATCTGCTGGACACAGCCTCAGCTTTATATAATGCT 
GCAGCMGATTCTCCTGGTTGTCTCTCTTAGTGCCCnCMCGCAGCnCCTGGCCAAMTnGCCGTTCCGAACCTG 
MGCTCACTTnGGMGAGAGACAGTACnGMTACAMGCACTMGCCTTGACGTGTGAGCAGCCTTCTACGGAGCA 
GCAGMTATCTAGTATCTTTTGGGGTCTGGGGCGCAGCCCTCATGCCTCTATACGCCTGCATTTGA 



FIG.20E 
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HBV-2B 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVMWTLKAAAFLPSDFFPSVNFLLSL6IHLYMDDVVL 
GVGLSRYVARLFLLTRILTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVNIPIPSSWAFKTPARVTGG 
VFKVGNFTGLYNLPSDFFPSVKTLWKAGILYKNVSIPWTHKGAALVVDFSQFSRNSAICSVVRRKEYLVSFGV 
WGLSLDVSAAFYNAAAKYTSFPWLLNAHPAAMPHLLKAAADLLDTASALYNSWPKFAVPNLKLTFGRETVLEY 
KAAWMMWYWGPSLYKAAARFSWLSLLVPFGAAALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCC 
TGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCC 
TAGCGATTTCTTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTG 
GGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAG 
AGACCACCGTGGTGAGGAGGCAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCT 
GCTGGTGCCCTTTGTGAATATCCCTATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGA 
GTGTTTMGGTGGGAMCnCACCGGCCTGTATMCCTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGT 
GGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGGGAGCCGCTCTGGTGGTGGACTT 
nCCCAGTTCAGCAGAMTTCAGCMTnGnCGGTGGTGAGMGAMGGMTATCnGTnCATTTGGCGTC 
TGGGGGCTGTCACTGGATGTMGTGCGGCATTTTACAATGCCGCCGCAAAATATACAAGCTTCCCATGGCTCC 
TAAACGCACACCCAGCTGCAATGCCGCATCTACTGAAAGCAGCCGCTGACCTCTTAGACACTGCCTCCGCTCT 
GTACMCTCTTGGCCCMGTnGCCGTGCCTAATCTCAAGTTGACCTTCGGTAGAGAGACAGTCTTAGAATAC 
AAAGCGGCCTGGATGATGTGGTACTGGGGACCCTCTCTGTATAAAGCCGCTGCAAGGTTCTCCTGGCTTAGCC 
TTCTCGTACCATTCGGAGCAGCTGCCCTAATGCCTTTGTACGCATGCATCTGA 



FIG.20F 
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HBV-21A 

MGMQVQIQSLFLLLLWVPGSRGSWPKFAVPNLKAAAAKFVMWTLKAAAKSTLPETTVVRRKHPAAMPHLLKAA/^ 
WKAGILYKKAFLLTRILTIGALSLDVSMFYNAAAKYTSFPWLLNAAARFSWLSLLVPFNMTPARVTGGVFKMEY 
VSFGVWGAMYMDDVVLGVNDLLDTASALYNAMFPHCLAFSYMKAM^ 
GIHLNIPIPSSWAFKAAWLSLLVPFVNAFLPSDFFPSVKLTFGRETVLEYKQAFTFSPTYK 

ATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATCTTGGCCTAAA 

TTCGCAGTGCCAAACCTTAAAGCCGCGGCTGCTAAGTTCGTAGCTGCCTGGACACTAAAGGCCGCCGCTAAGAGCACA 

CTGCCAGAGACCACCGTGGTCCGGCGAAAGCATCCAGCCGCAATGCCCCACTTGCTCAAAGCAGCCGCCCACACTCTT 

TGGMGGCTGGGATAnGTACMGAMGCCnCCnCTGACCAGGATAnMCTATCGGAGCTCTGTCACTCGACGTT 

TCTGCTGCCTTCTACMCGCGGCGGCAAMTACACTAGCTTTCCATGGCTACTCMCGCAGCCGCCAGATTTTCTTGG 

CTATCACTACTGGTGCCAmMTGCAGCMCACCTGCTAGAGTGACTGGCGGCGTCmAMGCAGCCGAGTAC 

GTGAGCTTTGGCGTCTGGGGTGCAGCGGCATATATGGATGATGTAGTGTTAGGGGTGAACGACCTCCTGGACACAGCC 

AGTGCGCTGTACMTGCAGCTGCATTCCCGCATTGCCTAGCCTTCAGTTATATGAAAGCAGCAGCCTGGATGATGTGG 

TACTGGGGACCGTCCCTTTATAMGCAGCTTCAGCMTCTC^ 

GGTATTCACCTGMCAnCCCATCCCTTCCTCATGGGCAnCAMGCCGCTTGGCTGAGTCTACTCGTACCTTTCGTT 

MTGCATnCTGCCCAGCGACTTTTTCCCCTCGGTAAMCTGACATTCGGACGCGAMCAGTCCnGMTATM 

GCCTTCACGTTCTCACCAACCTATAAATGA 

FIG.21D 



HBV-21B 



MGMQVQIQSLFLLLLWVPGSRGYMDDVVLGVNAAAEYLVSFGVWNDLLDTASALYGAAHTLWKAGILYKKAFLPSDFF 
PSVKAFPHCLAFSYMKMRFSWLSLLVPFNMSWPKFAVPNLKAMQAFTFSPTYKNAAASAICSVVRRKAFLLTRIL 
TINIPIPSSWAFKMWMMWYWGPSLYKAMTPARVTGGVFKAANFLLSLGIHLNLTFGRETVLEYKHPAAMP 
STLPETTVVRRKWLSLLVPFVNAAAAKFVMWTLKAAAKLSLDVSAAFYNAAAKYTSFPWLL 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATACATGGATGAC 
GTTGTGnAGGCGTTMTGCAGCCGCAGMTATCTCGTGTCAnCGGCGTCTGGMCGACCTGTTGGACACTGCATCT 
GCTCTGTACGGTGCAGCCCATACCCTGTGGMGGCCGGMTCCTCTACAAAMGGCAnCCTACCTAGCGACTTTTTT 
CCnCAGTGAMGCCTTCCCACATTGCCTAGCAnCTCGTATATGAMGCGGCTAGGTTCTCATGGCTTAGTCTTCTA 
GTACCTTTCMTGCCGCCTCCTGGCCCAMTTCGCCGTACCAMTCTAAMGCGGCCGCGCAGGCCTTTACATTCTCT 
CCGACnATAAAMTGCAGCAGCCTCCGCTAmGTAGCGTCGTGCGCCGAMGGCCnCCTGCTMCCCGGATTTTG 
ACGATAAACATCCCCATCCCnCTAGCTGGGCTnCAMGCAGCATGGATGATGTGGTACTGGGGTCCCAGCTTATAC 
AMGCTGCGGCMCCCCAGCMGAGTGACAGGGGGCGTGTTTAAGGCCGCCMCTTCCTCCTGAGTCTCGGAATACAC 
CTGMCTTMCCTTTGGGAGAGAGACAGTACTGGAGTATAAACACCCAGCAGCTATGCCGCACCTACTCAAAGCCGCT 
TCMCACTCCCAGAMCMCTGTAGTGAGGAGAAMTGGCTC^ 

MGmGTGGCCGCnGGACACnMGGCTGCAGCAAAGnGTCACnGATGnAGTGCAGCGnCTATAACGCAGCT 
GCAAAATACACTTCCTTTCCCTGGCTGCTGTGA 
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HB\)-30B 

MGMQVQIQSLFLLLLWVPGSRGFLLTRILTINAMSWPKFAVPNLKAMHTLWKAGILYKKADLLDTASALYNQAn"FS 

PTYKGAAANVSIPWTHKGAMFLLSLGIHLNIPIPSSWAFKAAALWFHISCLTFKAAAILLLCLIFLLNAAAYPALMPL 

YACINAHPAAMPHLLKAMSFCGSPYKAAGLSRYVARLNKYTSFPWLLNFLPSDFFPSVKAFPHCLAFSYMKAEYLVSF 

GVWNMLTFGRETVLEYKAAALPSDFFPSVKAYMDDVVLGVNLVVDFSQFSRNAAARWMCLRRFIINAARFSWLSLLVP 

FNMTPARVTGGVFKMWLSLLVPFVNSAICSVVRRKAKFVMWTLKAAAKWMMWYWGPSLYKMSTLPETW 

LDVSAAFY 

ATGGGMTGCAGGTCCAGATACAGAGCTTGTTCCTCCTCCTGCTTTGGGTCCCCGGATCAAGGGGT7TCCTCCTAACCC 
GCATCCTGACAATTAACGCCGCAGCCTCCTGGCCAAAATTTGCCGTGCCAAATCTCAAGGCAGCTGCACACACACTATG 
GAMGCAGGGATACTGTACMGAMGCCGATCTGCTAGACACAGCGTCTGCGTTGTACMCCAGGCTTTTACTTTCTCT 
CCTACATATAAAGGCGCAGCTGCAAACGTGAGTATCCCTTGGACGCACAAAGGAGCCGCTGCCAACTTCTTACTGTCCC 
TGGGCATCCATCTAMTATCCCTATTCCTTCATCCTGGGCATTTAAAGCAGCCGCCTTATGGTTCCACATAAGTTGTCT 
GACCTTCAMGCCGCAGCMTCCTGCTCCTnGCCTCATTTTCTTACTAAACGCCGCTGCCTATCCAGCTCTTATGCCA 
TTGTACGCATGTATCMCGCCCACCCCGCAGCMTGCCCCACCTCCTTAMGCTGCCGCCAGTTTCTGCGGTTCTCCTT 
ATAMGCAGCAGGGCTGTCCAGATACGTAGCTAGGCTAMCMGTATACCAGCTTCCCCTGGTTACTTAATTTCCTGCC 
GTCAGATTTCTTTCCATCAGTTMGGCCTTCCCTCATTGTCTGGCCTTTAGCTACATGMGGCTGMTATTTGGTATCC 
TTCGGCGTGTGGAATGCGGCACTGACATTTGGAAGGGAGACAGTGCTCGAGTACAAAGCCGCCGCACTACCCTCGGACT 
TCTTCCCATCGGTCAMGCTTACATGGACGATGTAGTCCTCGGCGTTMCTTAGTAGTGGACTTTTCTCMTTTTCCAG 
AMCGCAGCGGCCAGATGGATGTGCCTTCGGCGTTTTATAATAAACGCCGCTCGATTCAGCTGGCTATCACTCCTAGTT 
CCATTTMTGCAGCTACACCCGCACGGGTGACAGGTGGAGTTTTCMGGCAGCGTGGCTTTCACTGCTTGTGCCATTTG 
TGAACTCAGCTATTTGCTCAGTAGTGAGAAGGAAGGCAAAATTCGTCGCTGCCTGGACTCTCAAAGCTGCCGCAAAGTG 
GATGATGTGGTATTGGGGACCGAGCTTGTACAAAGCGGCCTCTACTCTGCCAGAAACTACCGTAGTGAGAAGAAAACTG 
AGCCTGGACGTCAGCGCGGCATTCTACTGA 

FIG.22D 



HBV-30C 



MGMQVQIQSLFLLLLWVPGSRGFLLSLGIHLNAAAKYTSFPWLLNAAARFSWLSLLVPFNAAFPHCLAFSYMKAALVVD 

FSQFSRGAILLLCLIFLLNAAAKTLWKAGILYKKAWMMWYWGPSLYKAYPALMPLYACIGAAAWLSLLVPFVNFLLTRI 

LTINIPIPSSWAFKAAAEYLVSFGVWNLPSDFFPSVKFLPSDFFPSVKDLLDTASALYNSWPKFAVPNLKAAASAICSV 

VRRKLSLDVSMFYNAAAKFVMWTLKAAAKMNVSIPWTHKGMGLSRYVARLNAAASTLPETT^ 

KAAARWMCLRRFIINASFCGSPYKMYMDDVVLGVI^LWFHISCLTFKAMTPARVTGGVFKAMLTFGRETVLEYKQA 

FTFSPTYK 

ATGGGMTGCAGGTGCAMTACAGTCTCTCmCTmGCTTCTCTGGGnCCAGGATCACGGGGCnCTTGCmGCT 
TGGGCATCCACCTAMTGCTGCTGCAAMTACACATCTTTTCCTTGGCTCCTTMTGCCGCCGCTAGGTTTTCATGGCT 
GAGTCTGCTAGTACCTTTCMTGCGGCTTTCCCACAnGCCTAGCTTTTAGCTATATGAAAGCTGCTTTAGTCGTGGAC 
TTTTCACAGTTTAGCAGAGGAGCAATCCTGCTGCTATGTCTGATATTCCTTCTAAACGCAGCAGCCCACACACTCTGGA 
AAGCTGGTATCCTTTACAAGAAAGCCTGGATGATGTGGTATTGGGGACCCAGCCTCTACAAAGCATACCCTGCCCTGAT 
GCCACTATACGCATGCATTGGCGCGGCAGCCTGGTTATCCCTTrrAGTACCGTTTGTCMCTTTCTATTAACCAGAATC 
CTGACGATTMTATTCCGATCCCMGnCCTGGGCATTCAAAGCAGCCGCGGAGTATCTGGTTTCATTTGGCGTATGGA 
ACCTGCCMGCGACnCTTTCCnCTGTTMGTTCCTCCCCTCCGATnCTnCCATCGGTGAMGACCTCCTTGATAC 
CGCGAGCGCTCTGTACMCTCGTGGCCAAMnCGCAGTTCCAMCCTAAMGCCGCCGCCAGTGCCATTTGTTCCGTG 
GTMGGAGAAMTTATCACTCGACGTGTCCGCAGCATTTTATAACGCTGCTGCAAAGTTTGTCGCAGCATGGACATTGA 
AGGCTGCAGCGAAAGCAGCAAATGTATCAATACCCTGGACCCACAAGGGTGCAGCCGGGCTGTCTAGGTATGTGGCGAG 
GCTAAACGCCGCCGCCTCAACACTGCCTGAGACTACTGTCGTGAGACGCAAACACCCTGCCGCAATGCCCCACCTGCTG 
AMGCAGCCGCACGATGGATGTGCCTCAGMGATTCATMTAAACGCTTCTTTCTGTGGGTCACCCTACAAAGCCGCTT 
ACATGGACGATGTGGTCCTCGGAGTGAATGCCCTCTGGTTCCATATCAGCTGCCTGACATTCAAGGCAGCCGCCACCCC 
CGCTCGTGTGACAGGAGGTGTCnCAMGCCGCGGCACTGACTTTCGGTCGGGAAACTGTATTGGAATATAAGCAGGCC 
TTCACATTCTCCCCAACATACAAGTGA 
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HBV-CL 

MQVQIQSLFLLLLWVPGSRGFLLSLGIHLNAAAKYTSFPWLLNAAARFSWLSLLVPFNAAFPHCLAFSYMKA 

ALVVDFSQFSRGAILLLCLIFLLNAAAHTLWKAGILYKKAWMMWYWGPSLYKAYPALMPLYACIGAAAWLSL 

LVPFVNFLLTRILTINAAAIPIPSSWAFKAMEYLVSFGVWNLPSDFFPSVKAMFLPSDFFPSVKAAADLL 

DTASALYNSWPKFAVPNLKAMSAICSVVRRKLSLDVSMFYNAAAKFVMWTLKAMKMNV 

AGLSRYVARLNAAASTLPEnVVRRKHPAAMPHLLKAAARWMCLRRFIINASFCGSPYKAAYMDDVVLGVNA 

LWFHISCLTFKAAATPARVTGGVFKAAALTFGRETVLEYKQAFTFSPTYK 

ATGGGMTGCAGGTGCAMTACAGTCTCTCTTCCTTTTGCTTCTCTGGGnCCAGGATCACGGGGCTTCTTG 
CnAGCnGGGCATCCACCTAMTGCTGCTGCAAMTACACATCTTTTCCTTGGCTCCTTAATGCCGCCGCT 
AGGTTTTCATGGCTGAGTCTGCTAGTACCmCMTGCGGCmCCCACATTGCCTAGCTTTTAGCTATATG 
AMGCTGCTTTAGTCGTGGACTTTTCACAGTTTAGCAGAGGAGCMTCCTGCTGCTATGTCTGATATTCCTT 
CTAAACGCAGCAGCCCACACACTCTGGAAAGCTGGTATCCTTTACAAGAAAGCCTGGATGATGTGGTATTGG 
GGACCCAGCCTCTACAAAGCATACCCTGCCCTGATGCCACTATACGCATGCATTGGCGCGGCAGCCTGGTTA 
TCCCTTTTAGTACCGTnGTCMCTnCTATTMCCAGMTCCT.GACGAnMTGCTGCCGCCATTCCGATC 
CCMGTTCCTGGGCAnCAMGCAGCCGCGGAGTATCTGGTTTCATTTGGCGTATGGAACCTGCCAAGCGAC 
nCTTTCCnCTGnMGGCCGCTGCTnCCTCCCCTCCGATTTCTTTCCATCGGTGAAAGCCGCTGCCGAC 
CTCCTTGATACCGCGAGCGCTCTGTACAACTCGTGGCCAAAATTCGCAGTTCCAAACCTAAAAGCCGCCGCC 
AGTGCCATTTGnCCGTGGTMGGAGAAMnATCACTCGACGTGTCCGCAGCATTTTATAACGCTGCTGCA 
MGTTTGTCGCAGCATGGACATTGAAGGCTGCAGCGAAAGCAGCAAATGTATCAATACCCTGGACCCACAAG 
GGTGCAGCCGGGCTGTCTAGGTATGTGGCGAGGCTAAACGCCGCCGCCTCAACACTGCCTGAGACTACTGTC 
GTGAGACGCAAACACCCTGCCGCAATGCCCCACCTGCTGAAAGCAGCCGCACGATGGATGTGCCTCAGAAGA 
nCATMTAAACGCTTCTTTCTGTGGGTCACCCTACAAAGCCGCTTACATGGACGATGTGGTCCTCGGAGTG 
AATGCCCTCTGGTTCCATATCAGCTGCCTGACATTCAAGGCAGCCGCCACCCCCGCTCGTGTGACAGGAGGT 
GTCnCAMGCCGCGGCACTGACTTTCGGTCGGGAAACTGTATTGGAATATAAGCAGGCCTTCACATTCTCC 
CCAACATACAAGTGA 



FIG.23C 
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HBV-HTL 

MGTSFVYVPSALNPADGPGPGLCQVFADATPTGWGLGPGPGRHYLHTLWKAGILYKGPGPGPHHTALRQAILC 
WGELMTLAGPGPGESRLVVDFSQFSRGNGPGPGPFLLAQFTSAICSVVGPGPGLVPFVQWFVGLSPTVGPGPG 
LHLYSHPIILGFRKIGPGPGSSNLSWLSLDVSAAFGPGPGLQSLTNLLSSNLSWLGPGPGAGFFLLTRILTIP 
QSGPGPG VS FGVW I RTP PAYRPPNAP I GPGPGVGPLTVNEKRRLKL I GPGPGKQCFRKLPVNRP I DWGPGPGA 
ANWILRGTSFVYVPGPGPGKQAFTFSPTYKAFLCGPGPGAKFVAAWTLKAAA 

ATGGGMCnCTTTTGTGTATGTCCCTTCCGCTCTGAACCCAGCAGACGGACCCGGGCCTGGCCTGTGCCAGG 
TCTTCGCCGACGCAACTCCCACAGGGTGGGGGCTGGGGCCAGGACCAGGCAGGCACTACCTGCATACTCTGTG 
GAAGGCAGGAATCCTCTATAAAGGGCCCGGCCCAGGCCCTCACCACACCGCCCTGAGGCAGGCCATCCTGTGC 
TGGGGGGAGCTCATGACCCTGGCCGGACCTGGACCCGGGGAGAGCAGACTGGTGGTGGATTTCAGCCAATTCA 
GCAGAGGAMCGGACCCGGCCCTGGGCCTTTTCTGCTGGCTCAGTnACATCTGCTATTTGTTCTGTGGTCGG 
CCCCGGGCCCGGACTCGTGCCTTTCGTGCAGTGGTTCGTGGGACTGTCCCCTACAGTCGGGCCCGGCCCAGGG 
CTGCATCTGTACTCCCACCCAATCATCCTCGGCTTCCGCAAGATTGGACCCGGCCCAGGCTCCAGCAATCTCT 
CCTGGCTCTCTCTGGACGTGTCTGCCGCCTTTGGCCCTGGACCAGGCCTGCAAAGCCTGACTAATCTGCTCAG 
eAGCMCCTGTCCTGGCTGGGACCTGGCCCAGGGGCTGGCTTCTTTCTGCTCACCCGGATTCTCACAATTCCC 
CAGTCCGGACCAGGACCAGGAGTCAGTTTCGGGGTGTGGATCAGGACCCCTCCTGCTTATAGACCACCCAATG 
CTCCAATCGGCCCCGGCCCTGGCGTCGGGCCACTGACCGTGAATGAGAAGCGCCGGCTGAAGCTGATCGGCCC 
TGGCCCTGGCAAGCAGTGCTTTCGCAAACTGCCCGTGAACAGACCTATTGATTGGGGCCCCGGCCCTGGAGCA 
GCCMCTGGATTCTCAGGGGMCMGCnCGTCTACGTGCCCGGGCCCGGACCAGGGMGCAGGCTTTTACCT 
TCTCTCCCACTTACAAGGCCTTCCTCTGTGGGCCAGGCCCCGGCGCCAAGTTTGTGGCAGCATGGACCCTCAA 
AGCCGCTGCCTGA 



FIG.24C 
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